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Abstract 
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Chapter  1 

Introduction 

1.1  Unreliable  Networks 

By  a  data-communication  network  we  mean  a  collection  of  geographically  distant 
computers  (nodes)  that  can  communicate  with  each  other  by  exchanging  messages 
via  communication  channels  (links).  From  our  point  of  view  the  purpose  of  a  data- 
communication  network  is  to  transport  messages  reliably  between  its  component 
nodes.  Most  conventional  wire-networks  have  highly  reliable  links.  In  this  kind  of 
network  the  usual  approach  to  reliable  message  transfer  is  to  include  in  each  mes¬ 
sage  enough  redundancy  to  make  error -detection  possible.  Typically  the  polynomial 
code,  also  known  as  cyclic  redundancy  code  or  CRC,  is  used.  This  code  satisfies 
the  standard  reliability-enhancing  requirements,  and  is  easily  implementable  in 
hardware,  see  [17]  for  more  details.  Upon  detection  of  an  error  most  link-level 
transmission  protocols  request  that  the  message  be  re-sent.  Typical  of  this  class  of 


protocols  is  HDLC  (High-level  Data  Link  Control)  used  in  CCITT’s  X.25  network 
interface  standard.  The  net  effect  of  an  error  is  to  add  to  the  transmission  delay, 
since  one  has  to  wait  until  the  message  is  retransmitted.  Also,  the  retransmitted 
copy  of  the  message  shows  up  as  an  increase  in  the  total  load  of  the  network.  If 
the  bit-error  rate  were  to  increase,  the  transmission  delay  might  become  unsat¬ 
isfactory.  In  this  contingency  we  need  to  seek  different  means  to  achieve  reliable 
communication.  There  are  other  types  of  network,  e.g.  packet  radio  networks,  that 
are  inherently  unreliable  and  would  benefit  from  techniques  designed  to  improve 
message  transmission  reliability. 

In  this  thesis  we  focus  on  approaches  that  use  two  or  more  transmission  paths 
simultaneously  to  transfer  a  message.  We  intend  to  trade  increased  use  of  network 
resources,  two  or  more  paths  instead  of  one,  for  a  lowered  transmission  delay. 
We  can  argue  in  favor  of  this  approach,  with  its  consequent  reduction  in  total 
throughput,  by  saying  that  when  using  the  more  conventional  approach  an  error 
results  in  the  repeated  use  of  the  same  path  for  the  same  message.  Thus,  with  a 
‘high’  bit  error  rate,  in  a  network  that  uses  the  standard  approach,  it  is  likely  that, 
in  the  end,  we  will  use  extra  resources  anyhow. 

The  extreme  case  of  multiple  path  transmission  is  flooding  the  network  with 
copies  of  the  message.  The  message  is  routed  so  that,  in  the  case  of  no  failures, 
a  copy  of  the  message  is  sent  along  every  link  of  the  network.  This  is  maximum 
network  resource  consumption.  We  are  looking  for  intermediate  strategies  between 
the  extremes  of  sending  a  single  copy  of  the  message  and  flooding  the  network  with 
many  copies  of  it. 


The  choice  of  the  specific  approach  to  be  used  depends  on  exactly  how  link- 
failures  behave.  When  we  talk  about  link-failures,  we  do  not  necessarily  mean 
the  actual  misfunction  of  the  hardware,  but  also  any  condition  that  results  in  an 
unsuccessful  transmission.  Here  we  are  mostly  concerned  about  the  correlation 
between  the  outcomes,  whether  failures  or  successes,  of  consecutive  attempts  at 
transmission.  In  other  words,  if  a  link  fails,  for  how  long  does  it  stay  ‘down’?  In 
this  thesis  we  assume  that  the  status  of  a  link,  i.e.  whether  ‘up’  or  ‘down’,  changes 
so  fast  that  keeping  track  of  it  would  be  pointless.  The  delay  formed  by  the  time 
it  takes  to  determine  a  link’s  status  plus  the  time  necessary  to  disseminate  that 
datum  is  sufficiently  large  that  the  received  information  is  already  obsolete.  This 
is  because  the  information  in  question  is  received  from  the  network  itself. 

A  packet  radio  network  with  mobile  nodes  fits  these  characteristics.  In  this 
scenario,  the  mobility  of  the  transmitters  and  the  fragility  of  the  medium  conspire 
to  produce  what  we  can  safely  call  an  ‘unreliable’  network.  More  so,  if  we  consider 
the  possibility  that  the  network  might  find  itself  under  attack,  an  external  agency 
might  try  to  prevent  successful  communication,  by  jamming,  say. 

We  next  mention  some  approaches  used  to  deal  with  node  and  link  failures  in 
the  more  usually  considered  case  when  the  network  state  remains  stable  for  long 
enough  to  make  possible  the  use  of  knowledge  about  actual  network  configuration. 
Strategies  proposed  range  from  the  very  dynamic  to  the  completely  static.  An 
example  of  the  former  type  is  the  one  used  in  the  ARPANET  in  which  traffic  is 
rerouted  in  an  attempt  to  use  low  delay  routes  and  avoid  links  with  long  queueing 
delays.  An  example  of  the  latter  is  the  one  proposed  for  the  IBM  network  in  which 
for  each  source-destination  pair  there  is  a  list  of  allowed  routes.  When  a  link  failure 
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is  detected  all  routes  utilizing  this  link  are  deleted  from  the  allowed  route  lists. 
S.  G.  Finn  proposes  in  [4]  a  class  of  network  synchronization  procedures,  called 
Resynch  Procedures.  These  are  mechanism  for  bringing  all  nodes  of  a  distributed 
network  to  a  known  state  simultaneously,  despite  arbitrary  finite  delays  between 
nodes.  The  resynch  procedures  are  used  to  implement  a  network  protocol  that  can 
guarantee  that  no  packets  will  be  lost  and  no  duplicate  packets  will  be  inadvertently 
received,  despite  arbitrary  node  and  link  failures.  J.  A.  Roskind  in  his  doctoral 
thesis,  see  [13',  proposes  the  use  of  precomputed  disjoint  spanning  trees  to  transmit 
information  about  topological  changes.  J.M.  Spinelli  in  his  Master’s  thesis,  see  [15], 
presents  an  algorithm  to  maintain  a  correct  view  of  the  network  topology  without 
transmitting  any  information  other  than  the  operational  status  of  links. 

This  thesis  examines  some  approaches  for  choosing  routes  for  two  copies  of  a 
message.  We  focus  on  maximizing  the  probability  that  at  least  one  of  the  copies 
arrives  at  the  destination  node.  We  ask  how  hard  is  it  to  calculate  these  routes. 
We  conclude  that  looking  for  a  pair  of  routes  that  satisfy  this  very  natural  criterion 
is  a  computationally  intractable  problem. 

The  rest  of  this  chapter  is  as  follows.  In  section  1.2  we  define  probabilistic 
networks.  This  is  the  basic  model  used  in  this  thesis.  In  section  1.3  we  give  an 
overview  of  related  work  to  be  found  in  the  literature.  In  section  1.4  we  consider 
alternative  ways  to  use  several  paths  simultaneously  for  transmitting  one  message. 
Section  1.5  gives  an  overview  of  this  thesis  including  a  summary  of  results. 


1.2  Probabilistic  Networks 


A  directed  network  is  a  tuple  (N,  L) ,  where  N  is  a  finite  set  of  nodes  and  L  C  N  x  N 
is  the  set  of  links  of  the  network.  Notice  that  this  definition  excludes  the  existence 
of  more  than  one  link  between  any  two  nodes.  A  link  of  the  network  is  denoted 
either  by  a  single  letter,  e.g.  /,  or  by  the  ordered  pair  of  nodes  connected  by  the 
link,  e.g.  (t,j>),  we  assume  that  the  network  includes  no  self-loops,  i.e.  there  is  no 
link  of  the  form  (t,  t)  in  the  network.  In  the  links  of  a  directed  network,  traffic  can 
flow  only  in  the  direction  given  by  the  ordering  of  the  nodes,  i.e.  in  link  (i,j)  we 
can  have  traffic  going  from  node  i  to  node  j ,  but  not  in  the  opposite  direction.  The 
network  includes  two  distinguished  nodes,  nodes  s  and  t.  Node  s  is  the  origin  node, 
node  t  the  destination.  In  a  similar  manner  we  can  define  an  undirected  network 
in  such  a  network  a  link  is  an  unordered  pair,  e.g.  (i,j),  traffic  can  flow  in  both 
directions  of  an  undirected  link.  In  what  follows  we  shall  be  referring  to  directed 
networks  unless  explicitly  stated. 

A  simple  path  joining  nodes  s  and  t  is  a  sequence  of  nodes  s,  n^n 2, . . . ,  nm,t, 
such  that  the  links  (s,  nj);  (n,-,  n,+1)  for  i  =  1, . . . ,  m  —  1;  and  (nm,  t)  are  all  members 
of  L,  and  such  that  no  node  in  this  sequence  is  repeated.  I.e.  the  path  does  not 
fold  onto  itself. 

To  each  link  l  G  L  we  associate  a  rational  number  0  <  pi  <  1,  the  probability 
that  link  /  is  operative.  Links  fail  independently  of  each  other.  This  independence 
assumption  is  unrealistic  but  necessary,  we  shall  see  that  even  with  its  inclusion 
many  questions  pertaining  to  probabilistic  networks  are  very  difficult  to  answer. 
If  we  give  up  this  assumption  it  can  be  shown  that  finding  one  simple  path,  P, 


Figure  1.1:  Transforming  an  Unreliable  Node 


connecting  nodes  s  and  t  in  a  given  probabilistic  network  such  that  the  probability 
that  P  is  operative  is  greater  or  equal  than  a  given  rational  threshold  is  an  NP~ 
complete  problem.  An  intermediate  approach  between  assuming  independence 
or  assumming  unrestricted  dependencies  would  be  to  consider  locally  correlated 
failures,  i.e.  if  a  link  has  failed  it  is  likely  that  ‘nearby’  links  have  failed  too.  We 
have  not  pursued  this.  We  shall  assume  that  the  nodes  of  the  network  are  perfectly 
reliable.  This  implies  no  loss  of  generality  because  any  unreliability  in  the  nodes 
of  the  network  can  be  modelled  by  unreliable  links.  To  wit,  see  figure  1.1  given 
an  unreliable  node  n,  and  given  pn,  the  probability  that  node  n  is  operative,  one 
can  separate  node  n  into  two  nodes,  n'  and  n",  joined  by  a  link  l  with  operation 
probability  equal  to  pn.  Make  all  links  incoming  into  n  arrive  at  n',  and  all  outgoing 
links  depart  n" .  The  two  versions  are  equivalent. 

A  path  P  connecting  nodes  s  and  t  is  said  to  be  operative  if  every  link  l  £  P 
is  operative.  Given  a  path  P,  P{P)  the  probability  that  this  path  is  operative  is 


(1.1) 


P(P)  =  [[PI- 

ieP 

We  consider  only  simple  paths  because  for  any  non-simple  path  P,  a  path  that 
loops  onto  itself,  there  exists  a  simple  path  Pa  C  P  such  that  P{P,  )  >  P[P). 
The  main  questions  concerning  probabilistic  networks  deal  with  the  existence  of 
operative  simple  s-t-paths.  We  next  give  an  overview  of  these  questions  and  the 
related  results  to  be  found  in  the  literature. 


1.3  Related  Problems 

The  problem  of  determining  “network  reliability”,  i.e.  the  probability  that  there 
is  a  path  connecting  node  s  to  every  other  node  in  the  network  has  been  proved 
to  be  #P-complete,  to  be  defined  below,  also  see  [  1 1] .  However,  if  the  graph  is  di¬ 
rected  and  acyclic,  network  reliability  can  be  computed  in  polynomial  time,  see  [l]. 
The  #P-completeness  result  holds  for  directed  and  undirected  networks.  Approx¬ 
imating  the  reliability  to  a  given  e  >  0  is  also  a  #P-complete  problem,  see  [11]. 
The  category  of  #P-complete  (read  “number  P”)  problems  was  proposed  in  [18]. 
It  includes  many  enumeration  problems,  i.e.  “given  a  problem,  how  many  solu¬ 
tions  are  there?”.  For  example,  associated  with  the  Satisfiability  problem  is  the 
enumeration  problem  of  counting  how  many  truth  assignments  satisfy  simultane¬ 
ously  the  clauses  of  the  problem.  This  category  of  problems  is  ‘harder’  than  ‘plain’ 
NP- complete  problems.  It  is  interesting  that  there  are  #P-complete  problems  for 
which  the  associated  search  problem  can  be  solved  in  polynomial  time.  For  more 


on  #P-completeness  see  [6). 


Similar  results  exist  for  the  “ s-t  reliability”  problem  where  we  are  given  two 
nodes  s  and  t  and  asked  what  is  the  probability  that  there  is  an  operative  path 
between  them.  Valiant  has  proven  that  this  problem  is  #P-complete  for  general 
directed  or  undirected  graphs,  see  [19].  There  are  algorithms  to  calculate  s-t  re¬ 
liability  in  time  proportional  with  |iV|  and  the  number  of  minimal  s-t  cuts  in  G. 
This  number  can  be  very  large,  exponential  in  |iV|,  but  need  not  be  so,  see  [12]. 

The  related  problem  of  finding  a  most  reliable  path  in  a  probabilistic  network 
has  a  polynomial  time  algorithm.  The  algorithm  is  equivalent  to  calculating  a 
shortest  path  between  the  nodes  s  and  t  with  the  length  of  a  link  given  by  —  log  pi, 
see  [9]. 


1.4  Alternative  Approaches 


Given  the  basic  idea  of  using  multiple  paths,  the  actual  implementation  can  be 
done  in  many  different  ways.  The  main  question  is,  how  are  the  paths  going  to 
be  chosen?  A  related  issue  is  how  to  characterize  the  reliability  of  the  links  of  the 
network. 

Given  two  nodes,  once  we  have  chosen  several  paths  that  connect  them,  these 
paths  can  be  used  in  different  ways.  The  simplest  approach  would  be  to  send 
identical  copies  of  the  message  we  want  to  transmit  along  each  one  of  the  selected 
paths.  Alternatively,  we  could  map  the  message,  by  a  suitable  coding  scheme, 
into  a  relatively  large  number  of  ‘micro-packets’.  The  coding  would  ensure  that 
the  recovery  of  a  given  fraction  of  these  //-packets,  say  eighty  of  a  hundred,  would 
enable  us  to  reconstruct  the  original  message.  Here  we  shall  call  such  a  coding 
scheme  a  ‘holographic’  code.  Each  path  would  carry  some  fraction  of  the  //-packets. 
The  former  procedure,  identical  copies,  would  be  preferable  in  a  situation  in  which 
there  is  some  correlation  between  the  failure  or  succes  of  succesive  attempts  at 
transmission  along  network  links.  The  latter  seems  r.  ore  suited  when  we  can 
assume  that  the  outcome  of  these  trials  are  independent  random  variables.  In  the 
rest  of  this  section,  we  go  briefly  over  alternative  approaches  suggested  by  the  basic 
multiple-path  transmission  idea. 
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1.4.1  Modeling  Reliability  with  Probabilistic  Networks 


The  most  common  way  to  quantify  the  reliability  of  a  link  is  to  associate  with 
it  a  probability.  The  most  basic  problem  this  setup  suggests  is  to  look  for  two 
paths,  Pi  and  P2,  such  that  the  probability  that  at  least  one  of  them  is  operative 
is  maximized.  That  is,  we  are  trying  to  maximize 

n  pi+  n  pi-  n  *><■ 

{llePl}  {l:iePiUP2} 

We  obtain  variants  of  this  problem  by  including  further  restrictions  on  the  paths, 
e.g.  we  could  ask  that  Pi  and  P2  should  be  disjoint,  i.e.  they  should  have  no  links 
in  common.  In  the  same  vein,  one  could  look  for  disjoint  paths  Pi  and  P2  such 
that  the  probability  that  both  of  them  are  operative  is  maximized.  That  is,  we  are 
maximizing 

n  pi- 

{uePiUPi} 

This  last  formulation  has  the  advantage  that,  after  the  appropriate  transformation, 
the  problem  can  be  solved  by  existing  special  purpose  efficient  algorithms,  see  [l6i. 
The  problem  is  transformed  into  a  minimum  cost  flow  problem. 

A  different  approach  is  to  fix  one  path  Pi  and  look  for  a  path  P2,  not  necessarily 
disjoint  from  the  first,  such  that  the  probability  that  at  least  one  of  the  two  paths 
is  operative  is  maximized.  We  say  that  P2  complements  Pj.  This  procedure  can  be 
iterated;  after  finding  P2,  take  it  as  the  fixed  path  and  then  find  P3  complementing 
P2.  This  can  be  repeated  until  we  are  satisfied  with  the  answer,  or  until  we  feel 
that  the  progress  being  made  is  not  worth  the  effort. 
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1.4.2  Modeling  Reliability  with  Additive  Costs 


An  alternative,  and  more  heuristic,  approach  is  to  assign  to  each  link  /  coefficients 
c,\  cf,  ...,  cj71,  where  m  is  the  number  of  paths  Px,  P2,  ...,  Pm,  that  we  are  looking 
for.  The  number  £’=1  c\  corresponds  to  the  cost  incurred  when  i  paths  use  link  l. 
The  c|  are  chosen  to  reflect  the  reliability  of  link  /;  the  more  unreliable  a  link,  the 
higher  the  cost  associated  with  its  use.  We  want  paths  Pi,  . . . ,  Pm,  that  minimize 
the  total  cost.  That  is,  m  paths  minimizing 

«(<) 

zzci. 

i  j=i 

Here  i(/)  is  the  number  of  paths  that  use  node  l. 

If  we  define  c}  =  -  lnpi,  then  a  set  of  minimum  cost  paths  P,  that  happen  to 
be  disjoint  will  maximize  the  probability  that  all  the  P,  are  operative. 

If  the  cj  satisfy  0  <  c}  <  cf  <  ...  <  cj71,  the  optimum  can  be  found  by  casting 
the  problem  as  a  minimum  cost  flow  problem.  To  each  link  l  =  (i,j)  correspond 
m  parallel  links,  i.e.  these  m  links  go  between  nodes  i  and  j,  P,  P,  . . . ,  /m,  with 
associated  costs  cj,  cf,  ...,  cj71  and  unit  capacity.  We  seek  a  minimum  cost  flow 
of  m  units  between  nodes  s  and  t.  Viewing  the  problem  in  these  terms  allows  us 
to  use  special  purpose  efficient  algorithms.  Also,  the  matrix  associated  with  the 
capacity  and  flow  conservation  constraints  of  this  problem  is  unimodular.  This 
guarantees  that  if  the  problem  is  feasible  there  will  be  an  optimal  integer  solution. 


1.4.3  Interpreting  the  Probabilites  as  Loss  Coefficients 

Another  approach  is  to  interpret  the  probabilities  pi  as  ‘transmission  coefficients’. 
That  is,  if  a  link  l  has  the  entering  flow  of  /;in,  the  flow  coming  out  would  be  given 
by  /i°ut  =  Pi//n-  To  each  link  we  assign  unit  capacity.  Given  that  there  are  m 
units  of  flow  entering  s,  we  want  to  find  a  feasible  flow  that  will  maximize  the  flow 
entering  node  t.  A  feasible  flow  is  such  that  for  each  link  /,  //n  <  1,  and  for  each 
node  n  the  flow  entering  it  is  equal  to  the  flow  coming  out,  i.e. 

E  p</.in=  £  //"•  (1.2) 

{ri  enters  n}  {u  leaves  n} 

This  specifies  a  linear  program.  Notice  that  the  solution  to  this  problem  is  not 
guaranteed  to  be  integer.  The  expression 


ai 


r 

leaves  n}  h m 


(1.3) 


can  be  interpreted  as  a  relative  routing  variable  for  outgoing  link  /.  That  is,  of  all 
the  flow  entering  node  n  a  fraction  of  it  given  by  cq  is  routed  along  link  /. 

This  scheme  is  compatible  with  the  ‘holographic’  coding,  since  the  subdivision 
of  a  message  into  //-packets  is  better  adapted  to  the  fractional  routing  described.  In 
this  case  the  capacity  of  a  link  would  be  set  to  the  number  of  //-packets  a  message 
is  mapped  into. 

From  a  probabilistic  point  of  view  this  approach  is  equivalent  to  maximizing 


the  expected  number  of  //-packets  arriving  at  the  destination  node. 


We  can  observe  that  in  a  scheme  as  the  above,  maximum  flow  in  a  network  with 
losses  plus  ‘holographic’  coding,  each  //-packet  should  contain  enough  redundancy 
to  allow  the  detection  of  transmission  errors.  The  link  protocol  would  most  likely 
be  designed  not  to  ask  for  //-packet  retransmission  when  a  link  transmission  error 
is  detected;  rather  retransmission  wouid  be  provided  on  an  end  to  end  basis  using 
time-outs. 


1.5  Summary  of  Results 

The  underlying  question  we  are  addressing  is:  what  can  be  done  when  we  do  not 
really  know  what  is  happening  in  the  network?  Most  published  routing  algorithms 
assume  accurate  knowledge  of  the  conditions  in  the  network:  topology,  and  traffic 
matrices  are  taken  as  known;  the  message  arrival  processes  are  often  considered  to 
be  Poisson,  and  so  on.  In  practice  the  situation  is  different.  We  need  ways  to  deal 
with  data-communication  networks  when  we  do  not  have  exact  knowledge  of  their 
status.  We  need  to  decide  how  to  represent  the  uncertainty:  one  could,  for  example, 
assume  a  probability  density  on  the  inputs,  or  one  could  take  that  it  is  only  known 
that  the  inputs  are  inside  a  given  range.  Corresponding  to  the  former  approach, 
probabilistic  weighting  of  inputs,  the  logical  next  step  would  be  to  optimize  the 
expected  value  of  an  appropiate  objective  function,  including,  perhaps,  restrictions 
on  variance.  Corresponding  to  the  latter  approach,  the  input  within  a  range,  a 
min-max  optimization  could  be  attempted. 

This  thesis  is  an  exploratory  foray  in  the  direction  expressed  above.  We  look 
at  a  basic  problem:  what  to  do  when  we  do  not  know  the  topology  of  the  net- 
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work.  More  precisely,  what  can  be  done  when  our  knowledge  about  any  link  of  the 
network  is  the  probability  that  this  link  is  operative.  We  are  looking  for  a  robust 
routing  scheme,  i.e.  a  scheme  that  would  produce  satisfactory  results  in  the  face 
of  uncertainty.  The  essence  of  most  attempts  at  designing  robust  systems,  as  for 
example  in  control  theory,  is  that  we  need  to  overdesign.  We  need  to  know  how  to 
overdesign  intelligently,  i.e.  how  to  get  the  most,  where  the  meaning  of  ‘most’  has 
yet  to  be  agreed  upon,  from  the  additional  expenses. 

In  the  rest  of  this  section  we  outline  the  rest  of  this  thesis. 

Chapter  Two:  Complementary  Path  Problem 

The  problem  considered  in  this  chapter  is  as  follows.  Given  nodes  s  and  t,  given  Px, 
a  path  that  connects  these  two  nodes;  find  P2,  a  path  also  connecting  s  and  t,  not 
necessarily  disjoint  from  Pi,  such  that  the  probability  that  at  least  one  of  these  two 
paths  is  operative  is  maximized.  We  prove  that  for  directed  or  undirected  networks 
this  problem  is  iVP-complete.  The  proof  is  a  reduction  to  Partition,  and  applies  to 
the  restricted  case  of  directed  acyclic  networks.  We  also  give  a  pseudo-polynomial 
algorithm  for  the  solution  of  this  problem. 

Chapter  Three:  Routing  Two  Messages  Along  Disjoint  Paths 

We  consider  the  following  problem.  Given  nodes  s  and  t,  find  Pi  and  P2,  two 
disjoint  paths  joining  nodes  s  and  t,  such  that  the  probability  that  at  least  one  of 
the  two  paths  is  operative  is  maximized.  We  show  that  this  problem  is  ^VP-complete 


for  general  directed  networks.  The  problem  is  shown  to  be  strongly  NP- complete 
if  we  allow  the  possibility  pi  =  1,  or  if  the  probabilities  are  given  in  logarithmic 
format,  i.e.  p;  =  an‘  for  rational  0  <  a  <  1  and  positive  integers  n,j.  We  give  a 
pseudo-polynomial  algorithm  for  acyclic  networks.  It  is  not  known  if  the  problem 
is  iVP-complete  for  acyclic  graphs.  The  related  problem  of  finding  Pi  and  P2,  two 
disjoints  paths  joinng  nodes  s  and  t,  such  that  the  expected  number  of  operative 
paths  is  maximized  is  also  shown  to  be  NP- complete. 

Chapter  Four:  Routing  Two  Messages  Along  Non-disjoint 
Paths 

We  address  the  same  problem  considered  in  chapter  three,  with  the  difference  that 
the  path-disjointedness  restriction  is  dispensed  with.  The  problem  is  shown  to  be 
iVP-complete,  even  in  the  case  of  an  acyclic  network.  We  give  a  pseudo-polynomial 
algorithm  for  acyclic  networks.  It  is  not  known  if  the  case  of  a  general  network,  a 
network  that  might  contain  cycles,  results  in  a  strongly  iVP-complete  problem. 


The  fifth  chapter  contains  the  conclusions  and  suggestions  for  further  research. 


Chapter  2 


Finding  a  Complementary  Path 

2.1  Introduction 

Onee  approach  to  routing  two  copies  of  a  message  in  a  data-communication  network 
is  to  fix  the  route  of  one  of  the  two  copies.  The  second  copy  is  then  routed  to 
optimize  a  given  criterion  that  is  a  function  of  both  copies’s  routes.  In  this  thesis 
the  criterion  to  be  optimized,  maximized,  is  the  probability  that  at  least  one  of  the 
two  copies  reaches  their  destination.  The  first  copy  might  be  sent,  for  example, 
along  a  most  reliable  path  of  the  network,  or  through  any  chosen  path  that  the 
specific  situation  suggests.  The  paths  followed  by  these  two  copies  need  not  be 
disjoint.  If  they  are,  then  the  optimal  route  for  the  second  copy  is  a  most  reliable 
path  in  the  network  obtained  by  removing  from  the  original  network  the  links 
belonging  to  the  path  followed  by  the  first  copy. 

The  failure  of  a  link  that  is  shared  by  both  copies's  paths  is  catastrophic  but, 


Figure  2.1:  An  example  where  sharing  links  is  indicated 


as  in  the  case  of  a  very  reliable  link,  sharing  might  be  preferrable  to  using  an 
alternative  unreliable  link  also  such  a  link  need  not  exist.  The  network  of  figure  2.1 
presents  such  a  case,  the  number  labeling  each  link  in  that  figure  is  the  probability 
that  the  link  is  operative.  Given  Pi,  the  fixed  path,  as  in  the  figure,  the  optimum 
Pi,  the  path  followed  by  the  second  copy,  shares  the  link  going  between  node  s 
and  node  2.  This  link  is  much  more  reliable  than  the  only  alternative  link  that 
connects  node  s  to  node  1. 

The  problem  described  in  the  previous  paragraph  is  one  of  the  simplest  that  can 
be  proposed  under  the  idea  of  sending  multiple  copies  of  a  message.  VVe  would  like 


to  obtain  an  efficient,  i.e.  polynomial,  algorithm  for  its  solution.  Unfortunately  in 
this  chapter  we  show  by  a  reduction  to  Partition  that  this  problem  is  A'P-complete. 
Therefore  a  polynomial  algorithm  is  ruled  out  under  the  assumption  P  ^  N P .  We 
present  a  pseudo-polynomial  algorithm  to  pick  the  routing  for  the  second  copy 
given  a  fixed  route  to  be  followed  by  the  first  copy.  The  algorithm  is  an  adaptation 
of  that  given  in  [9]  to  find  a  shortest  length  path  with  time  constrains. 

In  section  2.2  the  problem  is  stated  more  precisely.  In  section  2.3  we  give  the 
iYP-completeness  proof.  In  section  2.4  we  consider  a  variation  of  the  problem  in 
which  a  different  format  is  used  to  represent  the  probabilities.  In  section  2.5  we 
obtain  an  equivalent  problem.  The  algorithm  is  developed  in  section  2.6. 


2.2  Problem  Statement 

In  this  section  we  define  precisely  the  problem  introduced  in  the  previous  section. 

Complementary  Path  Problem 

INSTANCE:  Given  G  =  ( N,L )  a  directed  graph,  where  N  is  the  set  of  nodes  of 

the  graph,  and  L  C  N  x  N  is  the  set  of  links.  For  each  link  l  £  L  we  are  given  a 
rational  number  pi,  0  <  pi  <  i,  the  probability  that  link  l  will  be  operative.  Also 
given  is  pth  a  rational  threshold,  and  Pi  C  L  a  simple  directed  path  in  G  connecting 
nodes  s  and  t. 

QUESTION:  Is  there  Pj,  another  simple  directed  path  that  connects  nodes  s  and 

t,  such  that  the  probability  that  at  least  one  of  these  two  paths  is  operative  is  larger 
than  pth7-  A  path  is  operative  if  every  link  in  that  path  is  operative.  The  paths  Pi 
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and  P2  need  not  be  disjoint.  More  precisely,  is  there  a  Po  so  that 


p(pl,p2)  =  rip(+np‘-  n  pi  (2-4) 

u£Pi  I&P2  iCPi^P2  J 


A  P2  that  maximizes  P^P^Pi)  is  said  to  complement  Pi. 


2.3  The  Complementary  Path  Problem  is  NP- 
Complete 

In  this  section  we  prove  that  the  complementary  path  probk  m  is  iVP-complete. 
This  is  done  by  giving  a  construction  that  results  in  a  reduction  to  partition.  Next 
we  define  the  partition  problem. 

Partition 

INSTANCE:  Given  5  a  finite  set  of  positive  integers,  5  =  {ci,  c2, . . . ,  cm}. 
QUESTION:  Is  there  a  subset  A  C  5  such  that 

X)  c>?  (2-5) 

i€A  ieS\A 

The  sets  A  and  S  \  A  constitute  a  partition  of  5.  A  reduction  to  partition  is 
often  used  to  show  NP-hardness  of  problems  in  whose  statement,  loosely  said,  many 
numbers,  usually  positive  integers,  appear.  For  example,  the  well  known  Knapsack 
problem  is  easily  shown  to  be  NP-hard  by  a  reduction  to  Partition  the  numbers 


appearing  when  specifying  an  instance  of  the  Knapsack  problem  are  the  weights 
and  values  of  the  elements.  Partition  is  not  a  strongly  NP- complete  problem  i.e.  it 
can  be  solved  in  pseudo-polynomial  time  by  dynamic  programming.  Therefore,  for 
any  problem  shown  to  be  iYP-hard  by  a  reduction  to  partition  it  is  possible  that 
there  exists  a  pseudo-polynomial  time  algorithm  for  its  solution.  In  fact,  this  is  the 
case  for  the  complementary  path  problem,  in  section  2.6  we  offer  such  an  algorithm. 
Partition  appeared  in  the  original  list  of  21  JVP-complete  problems  published  by 
Karp  in  1972,  see  [8]. 

2.3.1  The  Reduction 

Theorem  2.1  The  complementary  path  problem  is  NP-complete. 

Proof.  The  problem  is  easily  seen  to  be  in  NP,  i.e.  for  any  given  pair  of  paths 
Pi,P2,  and  threshold  pth,  it  can  be  checked  in  polynomial  time  whether  P(P1?P2)  > 
Pth ■  Consider  the  network  shown  in  figure  2.2,  with  Pj  as  shown.  Px  includes  all 
links  Z°,  j  =  1, . . . ,  m.  Define  p,n  =  hi,  pn  =  £,,  let  all  other  links  in  the  network  be 

J  J  3 

perfectly  reliable.  A  path  P2  connecting  node  s  to  node  t  includes  one  and  only  one 
of  either  Z°  or  /],  j  =  1, . . . ,  m.  Given  P2  define  T  =  {j  :  l)  G  P2},  H  —  {j  :  Z°  G  P2}. 
Notice  that  T  U  H  =  {l,...  ,m},  and  T  n  H  =  0.  With  these  definitions  we  have 


p{PuPz)  =  n^+n^n^-n^n^ 

i=i  «er  i=i  >er 


ffh,fi  +  n(M-n«.V 


(2.6) 


Take  now  £,  =  (/i,)2.  We  then  have 


<er  ter 


Defining  now  b  —  fl™  i  hx  and  f(x)  =  x  —  x2  we  have 


^(p1,pa)=6(i+/in^ 


Maximizing  P(Pi,  P2)  is  equivalent  to  finding  T  C  that  maximizes 

/(llier^i)-  f{x)  is  a  continuous,  concave  function,  its  maximum  is  located  at 
x  —  1/2,  with  /( 1/2)  =  1/4.  Therefore  maximizing  P(PlxP2)  is  equivalent  to 
finding  T  such  that  |  n,er  K  -  l/2|  is  minimized. 

Given  S  —  {ci,...,cm},  where  the  cx  are  positive  integers  and  c,  =  M. 
Define  /i*  =  (l/2)2c,/w.  With  these  definitions  we  can  see  that  there  is  T  such  that 


ID: 


1  \  M  StgT  C<  1 


if  and  only  if  T  is  a  partition  of  5.  Or  equivalently,  if  and  only  if  for  the  corre¬ 
sponding  P2  we  have 


(2.10) 


where  the  last  equality  is  a  definition. 


The  /i*’s  are  not  necessarily  rational  numbers,  and  therefore  thev  cannot  be 


II i  w wm P'« i.'H  »■  ■  ■  ** ■■  V\"T  ■!■■ 


I 
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taken  as  input  data  for  the  complementary  path  problem.  We  next  give  a  sequence 
of  lemmas  by  which  we  prove  that  using  a  suitably  defined  rational  approximation 
of  the  h\ ’s,  which  we  denote  as  h,’s,  we  obtain  a  reduction  to  Partition.  The 
gist  of  the  argument  is  a s  follows.  Given  any  7\,  T2  C  {l,...,m}  we  first  prove 
that  either  n.er,  K  ~  FI ,er3  K  or  !  FFeTi  ^  ~  n,er2  K\  >  A.  We  then  prove  that 
if  we  have  /i,’s  that  are  ‘close  enough’  to  the  h"' s  we  shall  have  that  for  any  T 
n.er  K  ~  n.er^tl  <  A/4.  Thus,  given  the  value  of  flier  K  it  is  possible  to 
determine  the  value  of  flier  From  which,  if  we  have  fllg;r  h,  —  1/2|  <  A/4  we 
can  conc'ude  Flier  K  =  1/2.  Hence,  H,er  c«  —  (1/2)A/.  Then  we  prove  that  it  is 
possible  to  find  the  ht's  in  polynomial  time. 

Lemma  2.1  If  for  any  T  C  {l,...,m}  we  have  :  fll6r /i*  ~  Flier^!  <  then 
F/ier  c<  =  o  A/  :/  and  only  if  there  exists  P2  suc/t  t/iaf 

-  n**  (‘  - 1  (n  *.))  >  ft'-.  (*  -  dsO  -  **•  <2-n> 

m  which  the  last  equality  is  a  definition. 

Proof.  Define  g(i)  -  ( 1  '2)‘,  ‘vr,  i  =  1, . . . ,  M.  Notice  that  for  any  T  C  {l, . . . ,  m} 
there  is  an  integer  t.  1  <  i  <  m  such  that  ff,ej- /i'  —  9(0-  Therefore  given  Tf, T2  C 

{1 . m}.  such  that  h't  #  Flier^  there  exist  integers  1  <  i,j  <  M,i  ^  j 

such  that 


n  k  n  k 


9(0  -  9(9)  >  9(*'/  -  1)  -  g(M) 


1,  a  .  1,  2  In  2  „ 

>  -  2«  -  1  >  -  1  +  — —  +  R2  -  lj 

-  41  1  -  41  M  1 


In  2  1  _  A 

>  2M  >  4M  ~  ’ 


(2.12) 


in  which  the  last  equality  is  a  definition.  We  have  used  a  Taylor’s  series  expansion 
for  2Z,  in  which  >  0  is  the  remainder.  We  have  proved  that,  given  Ti,T*  C 
if  Flier,  5s  Flier,  hi  then 


i  n  k  -  n  vi  >  a  (2.i3) 

ieTi  ier3 


If  we  are  given  rational  approximations  /i,  of  the  /i*’s  (i  =  1, . . .  ,  m)  such  that 
for  any  T  C  {l, . . . ,  m}  it  is  guaranteed  1  fl.er  K  ~  II, er  /i,-|  <  A/4  =  1/16X1.  Then 
n,er  K  —  g(j)  implies  that  the  corresponding  product  of  the  ht's  over  the  same 
set  T  is  going  to  be  closer  to  g{j )  than  to  any  other  g(k)  for  k  ^  j .  Hence,  for  any 
T  C  {1, . . .  ,  m},  if  j  minimizes  |  IlieT  hi  —  g{k)  \  over  k  =  1, . . . ,  XI  we  can  conclude 
that  n<6T  h;  -  g{j)-  Consequently,  if  there  is  T  such  that  Il«er  —  1  / 2 j  <  1/16A/. 
we  can  infer  that  n,er  K  =  1/2,  which  implies  that  c,  =  1/2  and  viceversa. 
Since  f(x)  =  x  —  i2  is  a  concave  function  with  maximum  at  1/2  th  Jbove  is  also 
equivalent  to 


(2.14) 


and  thus  equivalent  to  the  existence  of  a  path  P2  such  that 


where  the  last  equation  is  a  definition. 


I 


Lemma  2.2  If  \h*  —  h<\  <  3  m /16M ,  for  i  =  l, . . .  ,m,  then 

IK -IT*.  j  <  1/16 M  (2.16) 

ieT  ieT 


for  any  T  C  {1, . . . ,  m}. 


Proof.  We  first  proof  an  auxiliary  lemma. 


Lemma  2.3  If  j/i’  -  ht\  <  e,  for  i  =  l,...,m,  where  0  <  <  1.  then  we 

have 

i  n  -  n  m  <  3|r,e.  (2.i7) 

<er  ieT 

Proof.  We  prove  this  by  induction  in  jT|.  For  jT|  =  1  this  statement  corresponds 
to  the  hypothesis.  Suppose  now  that  equation  (2.17)  holds  true  for  any  T  such  that 
jT1,  =  n.  Consider  now  a  set  Q  C  (1, . . .  ,m}  such  that  jQ|  =  n  1.  Take,  without 
loss  of  generality,  that  Q  =  {l,2,...,n  +  l}.  Let  T  =  { 1 , . . . ,  n}.Then  we  have 
n.eT  h  =  liter  K  +  Sn,  for  some  Sn  such  that  |<5n|  <  also  hnJrX  =  h'n^.1  +  6,  for 
some  <5  such  that  | <5  j  <  e.  Therefore 


.  ... 


From  which  have 


n  k  -  n  *. 

<6i 5  i eQ 


\6  n  K  +  6nh’n+l  -r  66n\ 

<'6T 


<  \6  n  A,-!  +  i^/C+ii  m  \86n\ 

<6  T 

<  ]«5|  +  \6n\  +  !<5„|  <  £  +  3n£  +  3n£  <  3"+1e.  (2.19) 


■ 


We  now  require  that  for  any  T  we  have  |  Fl,er  K  —  n.er  ^«|  <  A/4  =  1/16 A/.  W'e 
have  from  the  previous  lemma  that  to  obtain  ]  n.er  K  ~  Flier  ^<1  <  3lT'e  <  3me 
it  is  enough  to  ask  for  \h‘  -  hi\  <  e,  for  i  =  l,...,m  with  3me  <  1/16 M,  i.e. 
£  <  3~m/16A/.  I.e.  if  we  allow  an  error  of  £  <  3_T71/16A/  in  the  approximation 
of  the  hi*  by  the  h,,  we  will  obtain  that  the  n.er  are  within  a  tolerance  of 
A/4  =  1/16  M. 


I 

It  rests  to  show  that  the  /i,  can  be  calculated  in  polynomial  time,  and  that  we 
will  obtain  h,  whose  binary  representation  will  have  length  polynomially  related  to 
the  length  of  the  binary  representation  of  the  particular  instance  of  the  partition 
problem  reduced.  First  we  need  a  new  definition.  Given  a  =  a' /a" ,  where  a  is 
a  rational,  and  both  a',  a"  are  integers,  a"  ^  0,  we  say  that  the  height  of  a  is 


Lemma  2.4  Given  ci,  C2, . . . ,  cm,  positive  integers  such  that  2Jl=1  c,  =  XL ,  there 
exists  an  algorithm  that  finds  rationals  hi,  for  1  =  such  that  ht  — 

(1  2)2c,/'W  <  3_m/16 XI  in  time  polynomial  in  m  and  log  XI.  Xloreover  the  height 
of  the  h,  's  has  a  binary  encoding  of  length  polynomial  in  m  and  log  XI. 


Proof.  We  want  to  calculate  (1/2)1  where  0  <  x  —  2c, /XI  <  2.  We  have 
(1/2)*  =  1  / ex ln 2 .  Notice  also  that  if  xin2  >  1  we  have  exln2  =  eexln2_1,  with 
0<xln2-l<l.  Therefore  to  approximate  (l/2)x,0  <  x  <  2  within  a  precision 
of  e  =  3-m/l6A/  it  is  sufficient  to  be  able  to  calculate  ex,0  <  x  <  1  with  precision 
O(e),  and  to  be  able  to  calculate  ln2  within  0(e).  For  the  calculation  of  ln2  and 
of  ex,0  <  x  <  1  we  have  the  following  lemmas. 


Lemma  2.5  Let  x  be  a  rational  0  <  x  <  1  such  that  the  height  of  x  has  a  binary 
encoding  of  d  bits.  There  exists  an  algorithm  that  finds  a  rational  q  such  that 
ez  -  q |  <  e  in  time  that  is  polynomial  in  d  and  loge-1. 


Proof.  Using  a  Taylor’s  series  expansion  for  e1  in  the  interval  xG  0,1  we  get 


x 

1  +  2! 


(n  —  1)! 


(2.20) 


where  Rn  =  xneix/n\, 0  <  9  <  1.  It  is  plain  that  since  e  <  3  we  have  Rn  <  3 jn\. 
Choose  a  minimal  n  such  that  3/n!  <  e,  or  equivalently  log  n.!  >  loge^1  -1-  log  3. 
Therefore  logn!  =  0(loge_1).  From  Stirling’s  formula  we  have  that  Inn!  ss  nlnn. 
which  implies  n  —  O(nlnn)  =  0(loge_1).  Now,  define 


this  is  a  rational  approximation  of  ez.  The  height  of  each  of  the  terms  in  T, 


/ (A:  —  1) !  for  k  =  1, . . . ,  n  —  1,  has  a  binary  representation  of  0((n  —  \)d  + 
log(n  —  1)!)  =  0((log  e"1^  +  log  e_1)  =  0((log  e_1)d).  T  has  0(loge-1)  terms. 
Therefore  the  height  of  T  has  a  binary  encoding  of  0((log  e_1)2d).  Define  now 
a0  =  l,at+i  =  xa.i/i,i  =  1, ...  ,n  —  2,  then  T  =  ^7=1  from  which  we  conclude 
that  T  can  be  calculated  with  0(log  £-1)  additions  and  the  same  number  of  multi¬ 
plications.  Each  of  these  involve  rationals  whose  heights  have  binary  encodings  of 
lenght  0((log  e_1)d).  Take  q  =  T. 


Lemma  2.6  [Chandrasekaran  &  Tamir]  There  exists  an  algorithm  that  finds  a 
rational  q  such  that  ]  In  2  —  q\  <  e,  for  0  <  e  <  1,  in  time  polynomial  in  loge-1. 
Moreover  the  height  of  q  has  a  binary  encoding  of  lenght  polynomial  in  loge-1. 


Proof.  We  have  that 


lnx  =  ^2 


2  /*-  1 
2 k  +  a  \i  +  1 


(2.22) 


where 


_  (x  -h  l)2  /i-lV'1,3 

0  <  R  <  — - -  (  — —  )  ,  for  x  >  1 

2x(2x  +  3)  \x  +  1/ 


(2.23) 


Using  equation  (2.23)  with  x  =  2  we  get  R  <  3  (2"+3))  we  choose  a  minimal  n  such 


that  3  (2n+3)  <  c,  i.e.  n  =  0(log£  1).  In 2  is  now  approximated  by 


‘  ? 

=  y  — — : 


-(2*+l) 


(2.24) 


in  O (log  t  x)  time,  the  height  of  q  has  an  encoding  of  length  0((log  e'1)2)  bits. 
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Notice  that  we  have  also  proved  that  the  following  problem  is  iVP-complete. 
Rational  Multiplicative  Partition. 

INSTANCE:  Given  qu  q^, ...,  qm,  rational  numbers  such  that  q{  >  1/2,  t  = 
QUESTION:  Is  there  T  C  such  that  [Jt  =  1/2? 


2.4  Complementary  Path  Problem.  Logarithmic 
Input. 

In  this  section  we  consider  the  complementary  path  problem  when  the  link  prob¬ 
abilities  are  given  by  pi  =  an',  for  a  rational  0  <  a  <  1,  and  a  positive  integer 
nj.  The  input  of  the  problem  consists  of  the  rational  number  a,  and  the  integers 
ni,l  6  L.  We  say  that  this  is  a  logarithmic  representation  of  the  p(.  We  give  a 
similar  result  to  that  of  theorem  2.1;that  is  we  prove  that  this  problem  is  iVP-hard, 
the  proof  is  very  similar  to  that  of  theorem  2.1. 


The  complementary  path  problem  with  logarithmic  inputs  is  not  known  to  be 
in  NP.  That  is,  for  a  rational  a;  integers  n\,  n 2;  and  a  rational  pt/,;  it  is  not  known 
if  it  can  be  decided  in  polynomial  time  whether 

a"1  +  a"2  -  a"l+n2  >  pth. 

The  exponentiation  operation,  an,  can  be  realized  in  O(logn)  multiplications,  but 
the  length  of  the  binary  representation  of  the  result  is  not  a  polynomial  in  the 
length  of  the  input.  It  can  be  argued  though  that  in  comparison  with  the  stan¬ 
dard  method  of  representing  the  probabilities,  i.e.  as  a  ratio  of  two  integers,  the 
logarithmic  representation  contains  ‘more  information’,  in  the  following  sense.  If 
we  are  interested  in  representing  probabilities  p  in  the  range  0  <  e  <  p  <  1,  using 
the  standard  representation  we  will  need  0(log  e-1)  bits  to  encode  the  p’s,  on  the 
other  hand,  using  logarithmic  representation,  assuming  that  this  is  acceptable,  we 
need  0(log  log  e-1)  bits.  Consequently,  with  logarithmic  representation,  p  =  an, 
for  some  n,  can  be  calculated  in  0(Iog  e_1). 

.  In  [2j  it  is  shown  that  there  exists  a  polynomial  algorithm  for  deciding  whether 
ai‘  >  aT  1  f°r  rationals  <21,02  and  integers  n\,n2-  The  techniques  used  there  do  not 
seem  to  be  useful  for  the  problem  we  consider  here. 

Theorem  2.2  The  complementary  path  problem  with  logarithmic  inputs  is  NP- 
hard. 

Proof.  Consider  the  same  situation  as  in  theorem  2.1,  with  the  same  definitions 
as  made  there.  Consider  an  instance  of  partition  as  above.  Define  a*  =  (l/2)2/iVf, 


and 


n, o  =  Cj,  and  nt  1  =  2 Cj.  With  these  definitions  we  have  h]  =  a'  t’  =  a'  \ 
therefore  t*  =  /i*2.  As  before  there  is  T  such  that 


(2.25) 


if  and  only  if  £,6T  c,  =  (1/2 )M.  a*  is  not  necessarily  a  rational  number,  there¬ 
fore  it  cannot  be  used  as  part  of  the  input  for  the  complementary  path  problem 
with  logarithmic  representation.  We  next  prove  that  a  suitably  defined  rational 
approximation  of  a*  results  in  a  reduction  to  Partition. 


Lemma  2.7  There  exists  an  algorithm  that  finds  a  rational  a  such  that  la*fc  — o*|  < 
(1/16 M)  =  A/4,  for  any  k  —  1, ...  ,M  in  time  polynomial  in  logM.  Moreover  the 
height  of  a  has  a  binary  encoding  of  length  polynomial  in  log  M. 


Proof.  Consider  the  function  g(x)  =  x*,0  <  x  <  1.  By  a  Taylor’s  series  expansion 
we  have 


g{x0  +  Ax)  =  y(x0)  +  Axp'(x0  +  0Ax),  0  <  6  <  1.  (2.26) 


Therefore 


\g(x0  +  Ax)  -  g,(x0)|  =  |Axj|c/(x0  +  0Ax)|  =  |Ax/A:(xo  +  BAx)k  1 

<  Jfc|Ax|  <  M|Ax|.  (2.27) 

Where  we  have  taken  that  Ax  is  small  enough  to  guarantee  that  0  <  x0  +  Ax  <  1. 


Therefore  to  guarantee  a**  —  or  \  =  g(a’)  —  g{a)\  <  1/16 M  it  is  sufficient  to 
require  M\a'  —  a|  <  1/16 Af,  or  equivalently  a’  —  a|  <  1/16M2.  I.e.  we  need  to 
calculate  a *  =  ( 1  /2) 2/Ar  within  an  accuracy  of  1/16 M2.  The  proof  is  completed  by 
an  argument  similar  to  that  of  lemma  2.4. 
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G  iven  any  T  C  { 1 , . . . ,  m}  we  have  that 

\a*^'i€TCi  _  a^2,eTc>  j  <  —m 

Therefore  | a^^TCi  —  l/2|  <  A/4  if  and  only  if  £ier  c,  =  1/2.  Which  is  equivalent 
to  the  existence  of  a  path  P2  such  that 


—  Pth • 
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2.5  An  Equivalent  Problem 


In  this  section  we  are  going  to  show  that  the  complementary  path  problem  is  a 
special  case  of  a  more  general  problem  that  has  a  ‘cleaner’  structure.  The  algorithm 
to  be  presented  in  section  2.6  can  be  used  to  solve  this  more  general  problem.  In 


the  rest  of  this  chapter  we  shall  assume  that  we  have  pi  —  an'  where  0  <  a  <  1  is 
a  rational  and  nt  is  a  positive  integer,  i.e.  logarithmic  input. 

Deleting  the  first  constant  term  in  the  expression  for  P[P\,P2),  and  factorizing 
UiePiPi  , equation  (2.4),  we  get. 


R1  =  max  pi 

fi-  n 

A 

p 2 

l.  iePi\P2 

J 

( 

(  \ 

=  max  II  Pi 

1  -  n  pi 

n 

p 1  ISP, 

y  lePi 

[l*PiC\Pi  ) 

(2.28) 


Define  now 


' 

Pi,  if  l  £  Pi ; 

1,  otherwise, 


0  =  n  ?<• 

l€Pi 


(2.29) 


We  have  then 


R'  =  max  JJ  pt 

Pi  iePi 


'‘-'(a*)" 


(2.30) 


The  logarithm  is  a  monotonically  increasing  function,  therefore 


=  -In  R'  = 


^  (—  lnp()  —  In  1-3  J]  qt 


r 


mm 


mm 

Pi 


/ 

( 

f  (  \~1) 

\\ 

<  5Z(-  In  Pi)  -  In 
16P2 

1  —  0  exp 

rla*)  J 

Ji 

—  min 

Pi 


2Z  (~ln Pi)  -  In 
ieP2 


[ 

1  —  0  exp 

V 


E(-ln9i) 

iePi 


(2.31) 


We  assume  that  for  l  £  L,  pi  =  an‘\  with  0  <  a  <  1,  nj  a  non-negative  integer. 
Defining  now 


a  —  —In  a, 


di  =  —  In  pi  =  nia, 


wi  =  -Ingi, 
f(x)  =  —  ln(l  —  0ez). 


we  get  the  following  equivalent  problem.  Find  P2  achieving 


( 


r  =  mPin  \  Y,  d‘ + /  m  w‘ 

2  l  l&Pi  \l€Pi  , 


Note  that  /  is  an  increasing  function.  For  any  P2, 


/  =  -m*  -  n  p<)>  o. 

V£Pi  J  iePi\Pi 


(2.32) 


(2.33) 


(2.34) 


2.6  The  Algorithm 


Here  we  address  the  following  problem. 

Generalized  Complementary  Path  Problem. 

INSTANCE:  Given  G  =  (N,L)  a  directed  graph  as  in  section  2.2,  given  for  each 

link  /  two  integers  di  >  0  and  wi  >  0.  Given  /,  a  monotonically  increasing  positive 
real  function. 

TASK:  Find  P,  a  simple  directed  path  in  the  network  connecting  nodes  s  and  t, 
such  that 

J2d‘  +  f 

iep  \ieP  J 

is  minimized. 

With  the  definitions  given  in  equations  (2.29)  and  (2.32)  this  is  equivalent  to  the 
original  problem  as  defined  in  section  2.2.  In  this  section  we  derive  an  algorithm 
for  this  more  general  formulation. 

Given  Pi,  P2 ,  two  s-t  paths  in  G  we  say  that  Pi  dominates  P2  if  Hiep,  di  < 
JZi eP2  di  and  SiePi  wi  —  ^lePi  wi  and  af  ieast  one  of  these  two  inequalities  is  strict. 
An  s-i-path  Pi  is  said  to  be  non-dominated  if  there  is  no  other  path  that  dominates 
it.  We  now  have 

Lemma  2.8  The  minimum  of  g(P)  =  Y,iep  di  +  f{YliePw‘)  occurs  at  a  non- 
dominated  path. 

Proof.  Define  dP  —  ^i^p  d\,  wP  =  ^ZieP  wi.  g[P)  =  di  +  f{wi )  is  a  monotonously 


increasing  function  of  both  dP  and  wP.  Suppose  that  P *  minimizes  g(P)  and  that 
P”  dominates  P\  then  either  dP ••  <  dP .  or  wP-  <  wP *,  this  implies  g{P")  < 
g{P"),  which  is  a  contradiction. 

I 

Let  P 1  denote  a  path  from  node  s  to  node  j.  Define  c;(D)  as  the  minimum 
over  all  P 1  of  '£.ieP>  wi  subject  to  the  condition  that  YiiePi  dj  —  D.  If  for  a  given  D 
there  is  no  s-j  path  with  Yliepi  di  <  D  we  say  c;(D)  =  +oc. 

It  is  easy  to  see  that  Cj{D)  is  a  decreasing  function  of  D.  Also  if  ct(D )  <  ct(D- 1) 
we  have  that  there  exists  P‘  a  non-dominated  s-t  path  with  Y,ieP‘  wi  ~  ct(D)  and 
eP«  di  =  D.  Also  every  pair  ( w,d )  corresponding  to  a  non-dominated  path  shows 
as  w  —  ct(d)  <  ct(d  -  l). 

We  can  now  set  up  the  following  relations 

c,(0)  =  0, 

Cj(D)  =  -hoo  for  j  /  s  and  D  <  0, 

Cj{D)  ~  -  l),min{c*;(D  -  d{kJ))  -r  »(*,/)}}  (2.35) 

For  a  given  D,  assuming  that  c3(d)  is  known  for  x  <  D,  it  takes  0{\L\)  operations 
to  calculate  Cj(D).  We  can  start  with  D  —  1  and  proceed  until  D  —  M,  where  as  XI 
we  can  use  an  upper  bound  for  the  length,  as  measured  by  ]T)iep  d; ,  a  simple  s-t  path 


in  G  can  have.  I.e.  M  >  Hiep  di ,  for  any  simple  path  P  between  nodes  s  and  t.  For 
example,  A /  =  V](ei  d{  is  such  a  bound.  Given  that  a  simple  path  in  the  network 
can  have  no  more  than  j N)  —  1  links,  if  we  suppose,  without  loss  of  generality, 
di  >  d2  >  ■  •  •  >  dn,  then  M  —  di  is  a  tighter  bound.  M  =  (| N\  —  ljdj  can 

also  serve  as  a  bound.  If  after  deleting  all  links  l  such  that  W[  >  0,  nodes  s  and  t 
remain  connected  in  the  resulting  network  G\  then  we  can  use  as  \I  the  lenght  of 
a  ^-shortest  s-t  path  in  G1 . 

This  results  in  an  0{M\L\)  calculation.  The  presence  of  M  here  classifies  this 
algorithm  as  pseudo-polynomial.  To  find  the  optimum  of  g{P)  it  is  sufficient  to 
identify  all  pairs  (w,d)  corresponding  to  non-dominated  paths,  of  which  there  will 
be  at  most  \I,  and  identify  one  that  achieves  the  minimum,  an  operation  that  takes 
0(iV/)  steps.  Notice  that  since  the  minimum  of  g{P)  coincides  with  the  minimum 
of  P(Pi,P2)  in  the  complementary  path  problem  we  can  compare  the  different 
non-dominated  points  by  evaluating  P(P\,Pi)  for  these  points. 


Chapter  3 


Routing  Two  Messages  Along 
Disjoint  Paths 

3.1  Introduction 

In  this  chapter  we  address  the  following  problem.  We  wish  to  transmit  a  message 
between  two  given  nodes,  nodes  s  and  t,  of  an  unreliable  data-communication 
network.  To  increase  the  probability  that  the  message  is  received  we  will  send 
two  copies  of  the  message.  In  addition  we  include  the  restriction  that  the  paths 
followed  by  the  two  copies  should  be  link-disjoint,  i.e.  they  should  share  no  links. 
In  contrast  to  the  problem  considered  in  chapter  two  neither  of  the  two  paths  is 
fixed.  How  should  these  two  copies  be  routed  so  as  to  maximize  the  probability 
that  at  least  one  of  them  is  received  at  node  tl 

In  section  3.2  of  this  chapter  the  problem  is  precisely  stated  and  the  solution 


points  are  characterized.  In  section  3.3  we  give  a  solution  for  the  particular  case 
of  an  acyclic  network.  In  section  3.4  we  prove  that  this  problem  is  .VP-complete 
for  an  unrestricted  network,  i.e.  a  not  necessarily  acyclic  network.  For  logarithmic 
inputs  the  problem  is  shown  to  be  strongly  A’P-complete.  In  the  next  chapter  we 
address  the  same  problem  without  the  disjointedness  restriction. 


3.2  Disjoint  Paths 


We  now  consider  the  following  problem. 


Directed  two  link-disjoint  path  maximum  reliability  problem. 

INSTANCE:  Given  a  network  G  =  ( N,L ),  where  N  is  the  set  of  nodes  of  the 

network,  and  L  C  N  x  N  is  the  set  of  links.  Given  for  each  link  /  a  number  pi, 
0  <  pi  <  1.  Where  this  pi  is  the  probability  that  link  /  is  operative.  For  every  link 
l  we  have  pi  =  a"',  where  nt  is  a  non-negative  integer,  and  0  <  a  <  1. 

TASK:  Find  two  simple  link-disjoint  paths  Px  and  Pi,  both  joining  node  s  to 

node  t,  such  that  the  probability  that  at  least  one  of  the  two  paths  is  operative  is 
maximized.  A  path  is  operative  if  every  link  in  the  path  is  operative. 

Given  two  nodes  t,  j,  if  the  link  (i,j)  L ,  then  we  shall  take  nit})  =  oo.  We 
want  to  find  paths  P2  and  P2  such  that  ?{Pi,P2),  the  probability  that  at  least  one 
of  these  two  paths  is  operative,  is  maximized. 


p{PuP2)  =  n»+np*-n»np* 


(3.36) 


—  a 


V^ier,  1  £*>ier2 


(3.37) 


».  I  J I  I  ■  U  1 1 


Define  now 


9{rur2)  =  ar‘  ~  a1"2  -  a  ‘~T'-. 


(3.38) 


Then,  we  have 


P[Pi,Pz)  =  9 


\Yni'Y  n‘ 

\ier  i  Jt/’i  ./ 


(3.39) 


Theorem  3.1,  which  follows,  allows  us  to  characterize  an  optimum  pair  of  paths 
Pi ,  P^.  With  this  characterization  we  shall  later  be  able  to  produce  an  algorithm 
to  find  an  optimum  pair  of  disjoint  paths  for  an  acyclic  network. 


Theorem  3.1  Given  R,  a  finite  subset  of  points  (ri,r2)  belonging  to  the  positive 
orthant  of  R2 ,  where  by  R  we  denote  the  real  line.  A  point  (rj,r j)  £  R,  maximizing 
the  function 


ff(ri,r2)  =  ar‘  +  <T2  -  ar‘+r2  over  R  (3.40) 

is  an  extreme  point  of  the  convex  hull  of  Rd ,  By  R  we  mean  the  convex  hull  of  R, 
and  by  Rd  the  union  of  R  with  all  points  dominated  by  some  point  of  R.  .4  point 
(rj,rj)  dominates  a  point  (rl5r2)  if  and  only  if  rj  <  ri  and  rj  <  r2  and  at  least  one 
of  the  two  inequalities  is  strict.  I.e. 

Rd  =  R  U  {x  :  3r  £  Rsuch  that  r  dominates  s}  (3-41) 


\  % 


T 


Notice  that  R  C  R  C  Rd l  and  an  extreme  point  of  Rd  is  a  non-dominated  point, 
i.e.  it  is  not  dominated  by  any  other  point  m  Rd . 

* 

t 

|  We  shall  first  prove  three  auxiliary  lemmata,  and  then  give  the  proof  for  theorem 


3.1. 


L 


Lemma  3.1 


g(r  r?)  is  a  decreasing  function  of  both  of  its  arguments. 


Proof.  g{r\,r 2)  =  ari(l  —  ar-)  -1-  ar-,  for  fixed  r2  =  c  >  0  we  have 


f{r i)  =  g{ri,c)  =  (1  -  ac)ar‘  -y  ac 


(3.42) 


and 


df 

- —  =  (1  -  ac)ar‘  In  a  <  0. 

ar  i 


(3.43) 


I 

Lemma  3.2  A  point  (rj,rj)  maximizing  the  function  g  over  R  is  non-dominated. 

Proof.  Suppose  that  there  exists  (r^r^)  G  R  that  dominates  (rj,rj).  Then, 
either  ri  <  r(,  or  r2  <  rj  from  which,  from  lemma  3.1,  we  get  ff(rj,rj)  <  ff(ri,r2), 
this  is  a  contradiction. 


I 


Itl 


al 


Cl 


Figure  3.1:  Boundary  of  R 

Lemma  3.3  Given  points  (<ix , <i2 )  and  (ei,e2),  such  that  dx  <  ei  and  e2  <  d2,  the 
maximum  of  g  in  T,  the  convex  hull  of  (d1,d2),  (ei,e2)  and  {ei,d2)  is  attained  at 
(di,d2)  or  (ex,e2),  see  figure  3.1. 

Proof.  The  region  T  is  defined  by,  r i  <  eL,  r2  <  d2  and  r2  >  — br i  +  c,  where 
b  =  (d2  —  e2)/(ei  —  a'j)  >  0  and  c  =  ex6  +  e2  >  0.  By  lemma  3.2  the  maximum  of  g 
in  T  will  occur  at  some  point  of  L,  the  straight  line  connecting  points  (di,d2)  and 
(ei,e2),  every  other  point  in  T  is  dominated  by  some  point  in  L.  Now  we  need  only 
show  that  the  maximum  of  g  in  L  is  attained  at  {d\,d2)  or  (ei,e2). 


Define  p\  —  ari ,  p2  =  o’"2.  This  change  of  variables  maps  the  positive  orthant 


'2 


Figure  3.2: 


of  R2  to  the  unit  square  0  <  p\  <  1,  0  <  p2  <  1.  T',  the  image  of  T  under  this 
transformation  is  defined  by  pi  >  a'1,  p2  >  ad 2 ,  and  p2  <  acp^b,  see  figure  3.2. 

Evaluating  5f(r1,r2)  is  equivalent  to  evaluating  f{pi,P2)  =  Pi  +  Pi  —  PiPi ■  The 
maximum  of  /  in  T'  is  in  some  point  of  V ,  the  image  of  L.  Consider  now  J,  the 
straight  line  that  joins  ( adl,ad 3)  to  (a®1, a'2).  For  any  point  (pi,P2)  6  L',  there  is  a 
point  (p'x,p2)  G  J  such  that  f{p\,p'2)  >  /(pi,P2)-  We  have 


J  =  {(pi,P2)  :  pi  ~  —  mpi  +  n,  a®1  <  pi  <  a1*1;  for  some  m,n  >  0}.  (3.44) 


The  function  g  evaluated  in  J  is 


g(pi,  —mpi  +  n)  =  mp\  -f  (1  —  m  —  n)pt  +  n,  (3.45) 

this  is  a  convex  function  of  px.  This  implies  that  the  maximum  of  g  in  J  is  attained 
at  either  p  —  a‘l  or  at  p  =  adl,  i.e.  at  either  ( aei,a‘ 3)  or  at  ( adl,ad 2).  These  points 
belong  to  L',  this  implies  that  one  of  them  maximizes  g  in  V  too.  Therefore,  /  in 
T  is  maximized  by  either  (ei,e2)  or  (di,d2). 


I 

Lemma  3.4  A  non- dominated  point  of  R,  (rj,r2),  is  either  an  extreme  point  of 
Rd ,  or  there  are  two  extreme  points  of  Rd ,  nominally,  (ej,e2),  (di,d2),  such  that 
di  <  ri  <  t\,  and  e2  <  r2  <  d2. 


Proof.  Comes  directly  from  the  definitions. 


I 

Proof  of  Theorem  3.1:  Consider  (ri,r2)  G  R ,  suppose  (ri,r2)  is  not  dominated 
by  any  point  of  R ,  and  (ri,r2)  is  not  an  extreme  point  of  Rd,  see  figure  3.3.  Then, 
by  lemma  3.4,  there  exist  points  (ei,e2),  (dL,d2)  as  in  lemma  3.3,  extreme  points  of 
Rd  such  that  (fi,r2)  £  T,  as  defined  in  the  lemma.  Then,  either  (ei,e2)  or  (dl5d2) 
achieve  a  larger  value  of  g  than  that  achieved  by  (ri,r2). 
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Thus,  one  way  to  compute  the  optimum  of  P[P\,  Pi)  is  to  generate  the  set  of  all 
extreme  points  of  Rd,  and  then  find  which  of  them  maximizes  g.  If  the  number  of 
these  points  is  not  excessive  this  approach  is  practical.  In  the  rest  of  this  section  we 
whall  give  an  upper  bound  to  the  number  of  extreme  points  of  Rd .  As  R  we  take 
the  set  of  all  pairs  (rl5r2),  where  r,  =  Eiep,  ni> 1  =  1,2,  and  Pi,  P2  are  node-disjoint 
paths  that  connect  nodes  s  and  t  in  the  network  G. 

3.2.1  A  Bound  on  the  Number  of  Non-Dominated  Points 
of  Rd 

Observe  from  figure  3.1  that  R  is  symmetrical,  i.e.  if  (r!,r2)  is  in  R,  then  (r2,ri) 
also  belongs  to  R.  Let  r*  be  the  length  of  a  shortest  path  in  the  network,  a  path 
connecting  nodes  s  and  t,  where  the  length  of  link  l  is  given  by  n;.  Thus,  ar‘  is  the 
probability  that  a  most  reliable  path  in  the  network  is  operative.  Pick  a  shortest 
path  of  the  network,  it  need  not  be  unique.  Delete  from  the  network  all  the  links 
belonging  to  this  shortest  path.  Suppose  that  in  the  resulting  network,  G',  the 
nodes  s  and  t  remain  connected.  Calculate  in  G'  r**  the  length  of  a  shortest  path. 
For  any  non-dominated  point  (rl5  r2),  we  have  that  r1,r2  >  r*,  and,  as  (r*,  r")  £  R, 
we  have  rlfr2  <  r**.  Therefore,  the  number  of  non-dominated  points  is  bounded  by 
(r“  -  r*  4-  1),  see  figure  3.4.  Notice  also  that  if  the  point  (r*,r*)  £  R ,  i.e.  if  there 
are  two  disjoint  paths  both  of  shortest  length,  then  this  point  is  a  maximum  of 
the  function  g  on  R.  Notice  also  that  r**,  as  generated  here,  need  not  be  uniquely 
defined,  its  value  might  depend  on  which  shortest  path  was  originally  deleted  from 
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3.3  A  Solution  for  Acyclic  networks 


3.3.1  Generating  the  Extreme  Points  of  Rd.  Stratified  Net¬ 
works 

Given  G,  an  acyclic  network,  the  set  of  nodes  N  can  be  partitioned  into  disjoint 

layers  SUS2, . . . ,  Sm,  where  m  <  n,  and  U,=i . m  S ,■  =  N,  and  such  that  the  layers 

have  the  property  that  for  k  £  N  all  the  links  emanating  from  k  go  to  a  node  in  a 
higher  numbered  layer.  Additionally  we  also  impose  the  condition  that  S i  =  {s}, 
and  Sm  =  {n}.  Suppose,  without  loss  of  generality,  that  for  k  £  5,  all  the  links 
emanating  from  k  go  to  5't+1.  We  say  that  a  network  that  satisfies  these  properties 
is  a  stratified  network.  We  call  the  5,  the  strata  of  the  network.  See  figure  3.5. 

3.3.2  Squared  Network 

Given  a  stratified  network  G  =  (iV,L),  with  strata  Si,...,Sm,  we  form  G2,  the 
squared  network,  in  the  following  way.  We  denote  the  nodes  of  G‘  by  a  pair  of 
integers  e.g.  ij,  G2  consists  of  m  strata  52, . . . ,  Similarly  to  G,  S\  =  {ss},  and 
sm  =  {nn}. 

The  strata  S2  for  1  <  i  <  m  are  created  as  follows.  For  every  ordered  pair  of 
nodes  k,j  £  5,  there  exists  a  node  kj  £  S2.  There  is  an  arc  ( kl,rs ),  connecting 
nodes  kl  in  Sf  to  node  rs  in  5,2+1,  if  and  only  if  there  is  an  arc  connecting  node 
k  £  Si  to  node  r  £  5<+1,  and  an  arc  connecting  node  /  £  5,  to  node  r  €  S,+i,  unless 
k  =  l  and  r  =  s.  This  last  requirement  serves  to  guarantee  disjoint  paths.  See 


r  \ 

22 


Figure  3.6:  The  network  of  the  previous  figure  squared 


figure  3.6. 


A  link  (tk,  rs)  €  G2  is  labeled  by  two  coefficients,  c\ikja)  =  n^i>r)  and  cfik  r3)  = 

n(k,s}- 

To  every  path  connecting  node  ss  to  node  tt  in  G2  correspond  two  disjoint 
paths  in  G,  both  connecting  node  s  to  node  f,  and  viceversa.  The  squared  network 
device  transforms  the  search  for  two  disjoint  paths  in  G  into  a  search  for  a  single 
path  in  G2. 

3.3.3  Characterization  of  Extreme  Points 


To  each  path  from  node  ss  to  node  tt  in  G2  we  can  associate  two  numbers  V\  and 
i'2-  These  numbers  are  the  lengths  of  the  corresponding  disjoint  paths  Px  and  P ’2 


in  G.  For  a  given  path  P  in  G2,  ui  is  obtained  by  adding  the  c^s  of  the  links  in  P, 


v 2  is  obtained  in  analogous  way.  That  is,  v\  =  Y,ieP  ci>  an<^  u2  =  12 ieP  C1  ■ 

Obtaining  a  path  P*  minimizing  one  of  these  objectives  can  be  formulated  as 
a  minimum-cost-flow  problem.  To  each  link  /  £  L2  we  associate  a  positive  number 
fi,  which  we  interpret  as  the  flow  aloi.D  link  l.  These  fi  should  satisfy  the  following 
constraints.  With  the  exception  of  nodes  ss  and  tt  the  flow  in  each  node  of  G2 
should  be  conserved.  The  net  flow  in  node  ss  should  be  outgoing,  more  flow  going 
out  than  entering,  with  a  value  of  one.  The  net  flow  in  node  tt  should  be  entering, 
more  flow  entering  than  going  out,  with  a  value  of  one.  That  is, 


£ 

f{ra,kl) 

-  £ 

f(ki,rs)  =  o,  rs  E  L2,rs  ^  ss,tt. 

(3.46) 

{kl.(rs,kl)eL2} 

{ki-.{ki,T3)eL2} 

£ 

f[33,kl) 

-  £ 

f{kl,33)  1* 

(3.47) 

{kl:{)3,kl)eL2} 

£ 

f(tt.kl) 

-  £ 

/(«.«>  =  _1- 

(3.48) 

{kl:{tt,  kl)eL2}  {kl:{kl,tt)£L2} 


This  is  a  linear  program  with  two  objective  functions,  v[  and  v'2. 


V1  =  51  Cl  hr>M) 

(i rs,kl)eL 2 


V2  =  E  Clf{rsM)  (3-49) 

( r3,kl)eL 2 


Notice  that  the  feasible  region  defined  by  these  constraints  corresponds  to  R,  the 
convex  hull  of  R.  If  this  linear  program  is  expressed  in  matrix  form  the  matrix 
corresponding  to  the  flow-conservation  constraints  is  unimodular.  This  guarantees 


that  for  any  objective  linear  function,  if  there  is  an  optimum  solution,  there  is 
an  integer  optimum  solution.  Which  means  that  we  may  use  the  techniques  and 
results  of  linear  programming  to  find  P* .  Moreover  there  exist  standard  algorithms, 
see  [10],  to  find  the  set  of  all  non-dominated  points  of  a  multiple-objective  linear 
program.  In  their  simplest  form,  the  basic  approach  is  to  form  a  combined  objective 
function  =  v[  +  Auj,  for  A  >  0.  Non-dominated  points  with  respect  to  the 
objectives  v[  and  v'2  correspond  to  extreme  points  of  Rd.  Varying  A  the  different 
non-dominated  points  are  obtained.  We  need  consider  only  as  many  different  values 
of  A  as  there  are  non-dominated  points.  We  are  using  only  the  fact  that  the 
points  of  R  that  optimize  (3.36)  are  non-dominated  points.  To  obtain  a  more 
efficient  approach  we  would  have  to  use  additional  properties  of  the  function  we 
are  optimizing.  In  the  next  section  we  will  present  an  algorithm  to  find  two  paths 
optimizing  v\  without  explicitly  constructing  the  squared  network. 

3.3.4  Two  Minimum-Cost  Paths  with  Different  Path  Costs 

We  first  give  an  algorithm  for  a  more  general  problem,  and  then  specialize  it  to 
optimize  Given  G  =  (V,  L)  an  acyclic  network,  in  which  \N\  =  m,  we  can 
take  without  loss  of  generality  that  the  nodes  in  this  network  are  so  ordered  that 
for  any  link  (i,j)  we  have  i  <  j.  This  numbering  of  the  nodes  can  always  be 
done  in  an  acyclic  network,  see  [9j.  Let  then  s,  the  origin  node,  be  node  1;  and  f, 
the  destination  node,  be  node  m.  To  each  link  l  in  the  network  we  associate  two 
non-negative  numbers  c,1  and  cf. 

Define  for  i,  j,  a  pair  of,  not  necessarily  distinct,  nodes,  S(i,j )  as  the  set  of  all 


pairs  [Pi,  Pi)  of  directed  link-disjoint  paths  in  which  Pi  connects  node  s  to  node 
i,  and  P2  connects  nodes  t  to  node  j.  Notice  that  for  any  link  (q,r)  in  Pi  we  have 


q,r  <  i,  and  similarly  for  links  of  P2. 


Define 


d[i,j)  =  min  <  ^  c,1  +  Y)  cj  >  . 


(3.50) 


Our  objective  is  to  find  d(t,t).  We  take  d(s,s)  =  0.  That  is,  we  are  looking  for 
two  minimum  cost  link-disjoint  paths  both  joining  node  s  to  node  t,  for  which  the 
cost  of  a  link  /  belonging  to  the  first  path  is  given  by  cj,  and  the  cost  of  a  link 
l  belonging  to  the  second  path  is  given  by  cj.  If  we  specialize  the  c j  by  taking 
Cj  =  Ac,  ,Cj  =  n,,  we  are  minimizing  vx,  as  defined  in  the  previous  section. 


With  this  definitions  we  can  write 


d(i,/c)  =  min{(i(7,j)  +  cjyjk)J  i  <  k. 


(3.51) 


Similarly 


d(k,  i )  =  min  {  d(j,  i )  4-  c\hk) }  i  <  k. 


(3.52) 


And 


d{k,  k )  =  min  {c(V  fc)  +  c}]  k)  +  d{i,j)} 


(3.53) 


]*• 


This  is  because  any  link  of  the  form  (i,k)  cannot  be  part  of  any  Px,  P2  in 
S(i,j )  for  i,j  <  k.  We  can  calculate  the  d(i,j)  by  starting  with  d(  1, 1)  =  0,  and 


for  increasing  values  of  k  calculating  the  d{i,k ),  d(k,i),  and  d{k,k)  for  i  <  k.  We 
do  this  for  m  —  1  different  values  of  k,  and  each  time  we  take  0(k 2)  operations.  We 
have  an  algorithm  that  uses  0(m3)  steps.  The  optimizing  paths  can  be  obtained 
by  recording  the  values  of  the  indices  that  achieve  the  minimum  in  3.53. 

Take  now  c}  =  nj  and  cf  =  A m,  then  d(s,s)  corresponds  to  the  optimum  for  v\. 
One  could  try  to  generalize  this  approach  to  general  networks,  i.e.  not  necessarily 
acyclic  networks,  but  in  the  next  section  we  prove  that  the  problem  of  finding  two 
minimum-cost  link-disjoint  paths  with  different  path  costs  in  a  general  network 
is  iVP-complete.  Not  only  that,  but  the  directed  two  link-disjoint  path  maximum 
reliability  problem  is  also  iVP-complete,  as  will  be  shown  in  next  section. 


3.4  The  Directed  Two  Path  Maximum  Reliabil¬ 
ity  Problem  is  NP- Complete 

We  address  the  following  problem. 

Directed  two  link-disjoint  path  maximum  reliability  problem.  (Decision 
version.) 

INSTANCE:  Given  a  network  G  =  (N,L),  where  N  is  the  set  of  nodes  of  the 

network,  and  L  C  N  x  N  is  the  set  of  links.  Given  for  each  link  l  a  number  pi, 
0  <  pi  <  1.  Where  this  p;  is  the  probability  that  link  l  is  operative.  For  every  link 
l  we  have  pi  =  an‘,  where  n;  is  a  non-negative  integer,  and  0  <  a  <  1. 


QUESTION:  Are  there  two  link-disjoint  paths  in  the  network  Pi  and  P?,  such  that 


both  these  paths  connect  node  s  to  node  t,  and  the  probability  that  at  least  one  of 
these  paths  is  operative  is  larger  than  a  given  threshold  p*/,? 

Notice  that  the  existence  of  two  link-disjoint  paths  between  s  and  t  can  be 
established  by  a  maximum  flow  computation,  see  [3],  this  is  a  polynomial  operation. 

Given  Pi  and  P2 ,  P{P\,  Pi),  the  probability  that  at  least  one  of  these  two  paths 
is  operative  is  given  by 

p{Pi,P2)  =  n  ^  +  n  ^  ~  n  ^  n  (3-54) 

iePi  ieP2  lePi  ieP2 

If  the  pi  are  rational  numbers,  i.e.  if  a  is  a  rational  number,  then  this  problem  is 
in  NT.  I.e.  given  Pi  and  P2  one  can  decide  in  polynomial  time  if  P(Pi,P2)  >  pth 
for  a  given  pth. 

We  shall  now  prove  that  the  problem  is  NT-hard.  This  will  be  done  by  show¬ 
ing  that  if  we  can  solve  the  two  link-disjoint  path  reliability  problem,  as  defined 
above,  then  we  can  also  solve  another  problem  that  already  been  proved  to  be 
NT-complete.  The  problem  we  are  going  to  use  is  as  follows. 

Directed  two  link-disjoint  path  problem  with  distinct  origins  and  desti¬ 
nations. 

INSTANCE:  Given  a  directed  network  G'  =  ( N',L '),  and  given  si,  s2,  1 1,  t2,  four 

distinguished  nodes  of  this  network. 


QUESTION:  Are  there  in  this  network  two  link-disjoint  paths  P2  and  P2  such  that 
Pi  connects  si  to  tx  and  P2  connects  s2  to  <2? 


Original  Network 


Figure  3.7:  The  original  network  with  added  links 


For  the  NP- completeness  proof  of  this  problem  see  [5 


3.4.1  The  Reduction 


Given  G’  and  Si,  S2,  1 1,  we  augment  the  network  in  the  following  way,  see 
figure  3.7.  Add  two  nodes  s  and  t,  also  add  four  links:  (s,  s i),  (s,S2),  and  (ti,t), 
(ti,t).  Assign  link  probabilities  to  this  augmented  network  as  follows:  P(s,3l)  = 
P(ti,t)  ~  (1  /2)3ri,  and  pt  =  1/2  for  all  other  links,  where  n  is  the  number  of  nodes  in 
the  network.  Notice  that  if  we  use  standard  binary  representation  for  the  pi,  the 
number  of  bits  necessary  to  specify  them  is  0(n2). 


Any  pair  [Pi,  Pi)  of  link-disjoint  paths  joining  s  to  t  will  necessarily  use  the 
four  links  that  were  added  to  the  original  network.  Take,  without  loss  of  generality, 


“Straight  Paths 


“Crossed  Paths” 


Figure  3.8:  ‘Straight’  and  ‘Crossed’  paths 

that  Pi  uses  (s,si),  and  P2  uses,  necessarily,  (s,S2).  In  the  case  that  Pi  uses  (ti,t), 
and  P2  uses  (f2,0>  we  say  that  ( Pi,P2 )  is  a  straight  pair.  Otherwise,  if  Pi  uses 
{t2,t),  and  P2  uses  (ti,t),  we  say  that  (Pi,P2)  is  a  crossed  pair  of  paths,  see  figure 


Lemma  3.5  If  |!V'|  =  n  >  2,  i.e.  if  the  network  has  at  least  two  nodes,  then  for 
any  straight  pair  (Pf,  Pf),  and  any  crossed  pair  with  the  probabilities  as 


assigned  above,  we  have 


P(Pt,Pf)  >  P{Pt,Pc2)- 


(3.55) 


Proof.  We  have 


p(p;,pp  =  iipi+  n  pi  -  n  p*  n » 


i€P’  iep;  iep;  ieP ,* 


>  TT  P‘  >  (l/2)n_1+2  >  (1/2)' 


(3.56) 


(3.57) 


(3.58) 


p{pt,pi)  =  nft+n«-np«nft 

iePt  i&pz  ieP f  ieP“ 


n  pi  +  n  pi  ^  pm + 

(3.59) 

ieP;  ieP 2C 

2(l/2)3n  =  (1/2)3"-1  <  (1/2)2". 

(3.60) 

Therefore, 

(3.61) 

I 

Theorem  3.2  77ie  directed  two  link-disjoint  path  maximum  reliability  problem  is 
NP-complete. 

Proof.  Given  G  and  s,-,  i  =  1,2,  augment  this  network  and  assign  link  prob¬ 
abilities  as  above.  From  Lemma  3.5  we  know  that  there  is  a  pair  of  link-disjoint 
paths  Pi  and  Pi  connecting  s  to  t  with  P{Pi,Pi)  >  pth  —  (1  /2)2",  if  and  only  if 
there  exist  two  link-disjoint  paths  P[  and  P[  so  that  P[  connects  si  to  t\  and  PI 
connects  s-i  to  1 1. 


P(Pt,P‘)>(l/2)2n>  P(Pl,P‘). 


I 


In  the  proof  of  lemma  3.5  all  the  probabilities  have  the  form  pi  =  (l/2)s^n^, 


where  n(  =  gin )  is  a  polynomial  in  n.  From  this  we  have,  see  [6],  following 


Corollary  3.1  The  directed  two  link-disjoint  paths  maximum  reliability  problem  is 
strongly  NP-complete. 

Also,  if  we  allow  pt  —  1,  lemma  3.5  has  an  analogue  with  P{tut)  —  =  1/ 2, 

and  pi  —  1  for  the  rest  of  the  nodes.  If  the  data  are  given  in  the  non-logarithmic 
format,  i.e.  pi  is  a  rational  number,  the  pi  are  0(1),  and  therefore  a  polynomial  of 
n  hence  we  have 

Corollary  3.2  The  directed  two  link-disjoint  path  maximum  reliability  problem 
with  standard  input  format  is  strongly  NP-complete  if  perfectly  reliable  links  are 
allowed. 

It  is  interesting  that  the  existence  of  two  link-disjoint  paths  if  G  is  an  undirected 
network  is  not  an  NP-complete  problem.  Y.  Shiloach  has  given  a  polynomial  algo¬ 
rithm,  see  [14]. 

3.4.2  Related  Problems 

The  intermediate  problem  treated  in  section  3.3.4  can  also  be  shown  to  be  iVP- 
complete  when  the  network  is  not  assumed  to  be  acyclic.  The  proof  is  very  similar 
to  that  given  in  the  previous  section.  The  problem  is  as  follows. 

Directed  two  link-disjoint  weighted  path  problem. 

INSTANCE:  Given  a  directed  network  G  as  in  3.1  Given  for  each  link  l  of  this 

network  a  non-negative  integer  nj.  Given  also  a  rational  number  A  >  0. 


/V 

a; 


QUESTION:  Are  there  two  link-disjoint  paths  Pt  and  P2  such  that 


F{Pi,Pi)  —  ^  ni  +  A  ^  ni  —  fth  (3.62) 

iePi  ieP7 

for  a  given  rational  threshold  value  fth  >  0? 

Notice  that  for  A  =  q/r,  where  q  and  r  are  positive  integers  this  is  equivalent 


to 


F'{Pi,Pt )  =  r  E  +  ?  E  ni  ^  (3.63) 

i  ePi  ieP2 

It  is  easy  to  see  that  this  is  an  NP  problem.  I.e.,  given  two  paths  Pi  and  P2  it  is 
possible  to  decide  in  polynomial  time  if  3.63  holds  or  not. 


To  prove  that  this  is  an  JVP-complete  problem  we  reduce  the  same  problem  used 
in  the  previous  section.  The  proof  is  very  similar.  Given  a  directed  network  G' 
and  four  distinguished  nodes  as  before,  we  augment  the  network  as  in  the  previous 


section.  Assign  link  numbers  in  the  following  way:  n(J|3l)  =  n^tut)  =  lOn,  the  rest 
of  the  links  are  assigned  nj  =  1.  Take  q  =  10  and  r  =  1.  Then  we  have. 


Lemma  3.6  For  any  crossed  pair  of  paths  ( P[ ,  P2C )  so  that  both  these  paths  connect 
s  to  t  we  have 


v; 

VJ 


P'(P(\P2C)  >  50n. 


(3.64) 


Proof.  We  have 


P'(Pi,P2)  =  n<  +  10  H  ni  -  10(10n)  =  100n. 
iep; 


(3.65) 


iePi 
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Lemma  3.7  If  there  is  a  straight  pair  (Pas,P/)  so  that  both  these  paths  connect  s 
to  t,  then  there  is  a  straight  pair  (P*,  Pf)  such  that 

F'{Pf,Pf)  <  50n.  (3.66) 


Proof.  Of  the  paths  P‘  and  P/  one  of  them  must  use  the  link  (s,si),  take  that 
path  as  Pf,  take  the  remaining  one  as  P2S.  Then  we  have 

F'{Pf,  Pf)  =  Y,  +  1°  E  <  2(10n)  +  (n  -  1)  +  10(n  -1  +  2)  (3.67) 

l€P •  16P‘ 

<  30n  <  50 n.  (3.68) 

■ 


Now  we  can  prove 

Theorem  3.3  The  directed  two  link-disjoint  weighted  path  problem  is  strongly  NP- 
complete. 


Proof.  Given  G'  and  its  four  distinguished  nodes,  augment  this  network  and 
assign  the  ni  as  above.  From  the  lemmata  3.6  and  3.7,  we  know  that  there  is  a 


pair  of  link-disjoint  paths  Px  and  P2  connecting  s  to  t  with  F(Pi,  P2)  <  fth  =  50n, 
if  and  only  if  there  exist  two  link-disjoint  paths  P(  and  P2  such  that  P[  connects 
Si  to  1 1  and  P2  connects  s2  to  t2.  All  coefficients  in  the  problem  are  a  polynomial 
in  n,  therefore  the  problem  is  proved  to  be  strongly  NP- complete. 


I 

Notice  that  from  this  we  have  that  the  more  general  problem  of  deciding  if  there 
exist  link-disjoint  paths  P t  and  P2  that  connect  s  to  f  and  such  that  ep,  c,1  + 
Ei ePic1  -  c^>  f°r  giyen  non-negative  integers  cth,  cj,  l  E  L,  i  =  1,2,  is  also  an 
NP-complete  problem. 

Also  it  is  interesting  that  if  in  the  proof  of  section  3.4.  we  change  the  definition 
of  P(Pi,P2)  to  read 

p{Pi,P2)=  Y[pi+Y[pi>  (3-69) 

tePi  ieP2 

the  proof  still  holds.  Of  course,  now  P(Pi,P2)  is  not  a  probability  any  more. 
It  can  be  interpreted  to  be  the  expected  number  of  paths  that  are  operative,  or 
equivalently,  the  expected  number  of  copies  that  arrive  at  node  t.  This  also  would 
have  made  sense  as  an  alternative  routing  criterion.  I.e.  we  could  have  started  by 
trying  to  find  two  paths  P2  and  P2  such  that  if  we  were  to  send  one  copy  of  the 
message  along  each  one  of  them  the  expected  number  of  copies  arriving  at  node  t 
is  maximized.  As  we  just  saw  this  problem  is  also  NP- complete. 
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Chapter  4 

Routing  Two  Messages  Along 
Two  Non-Disjoint  Paths 

4.1  Introduction 

The  problem  we  deal  with  in  this  chapter  is  similar  to  the  one  addressed  in  chapter 
three  the  difference  is  that  here  we  dispense  with  the  requirement  that  the  paths 
followed  by  the  two  copies  of  the  message  be  link-disjoint.  This  gives  us  more 
freedom  in  choosing  these  routes,  offering  thus,  possibly,  a  higher  probability  that 
the  message  be  received.  The  problems  addressed  in  chapters  two  and  three  could 
be  considered  to  be  restrictions  of  this  one. 

In  section  4.2  we  prove  that  this  is  an  iVP-complete  problem  if  the  probabili¬ 
ties  are  given  as  rational  numbers,  and  iVP-hard  if  the  probabilities  are  given  in 
logarithmic  format.  In  section  4.3  we  give  a  pseudo-polynomial  algorithm  for  its 


solution  when  we  have  logarithmic  inputs. 


4.2  The  Two  Non-Disjoint  Path  Maximum  Reli¬ 
ability  Problem  is  JVP- Complete 

The  problem  is  as  follows. 

Two  path  maximum  reliability  problem. 

INSTANCE:  Given  a  network  G  =  ( N,L ),  where  N  is  the  set  of  nodes  of  the 

network,  and  L  C  N  x  N  is  the  set  of  links.  Given,  for  each  link  /,  a  rational 
number  pi,  0  <  pi  <  1,  where  pt  is  the  probability  that  link  l  is  operative.  Also 
given  is  a  rational  threshold  pth- 

QUESTION:  Are  there  two  simple  paths  Pi  and  P 2,  both  joining  node  s  to  node  t, 

such  that  the  probability  that  at  least  one  of  the  two  paths  is  operative  is  greater 
or  equal  than  pth- 

We  prove  that  this  is  an  iVP-hard  problem  by  considering  a  specific  topology, 
see  figure  4.1,  we  show  that  by  solving  the  problem  for  this  network  we  solve  the 
complementary  path  problem,  considered  in  chapter  two,  for  the  same  network; 
this  proves  that  the  problem  is  reduced  to  partition. 

Consider  the  network  of  figure  4.1.  This  is  the  same  network  as  in  figure  2.2. 
As  in  chapter  two  we  define  p;i  =  p?0;  this  implies  p, o  >  p,  1. 

i  j  1  i 

Given  the  tuple  (Pi,/^)  consisting  of  two  paths  that  connect  nodes  s  and  t, 


let  P(Pi,Pi)  be  the  probability  that  at  least  one  of  the  two  paths  is  operative.  A 
path  P  in  this  network  can  be  characterized  by  specifying  whether  P  includes  l°- 
or  /  •  for  j  =  1, ...  ,m.  To  a  given  path  P  we  now  associate  m  binary  variables 
PJ,j  =  1 .  ,m.  Where  P 3  =  0,  if  1°  G  P,  P;  =  1,  if  G  P.  We  prove  two  lemmas 
that  shall  then  be  used  to  prove  the  main  result. 

Lemma  4.1  If  (Pi,/^)  is  the  optimum  solution  for  the  two  non-disjoint  paths 
maximum  reliability  problem,  then  either  P[  =  0  or  P2  =  for  j  =  1 , . . . ,  m.  The 
‘or’  in  the  previous  statement  is  not  exclusive. 

Proof.  Suppose  this  is  not  true.  I.e.  for  some  j  we  have  that  P/  =  P2  =  1;  this 
means  that  the  link  /}  is  in  both  Pi  and  P2.  Therefore, 

p{Pi,P2)  =  ru+EU-  n  pi 

iePi  iePi  iePiuP2 

=  Pl\Pa  +  Pl\Pb  -  Pl\Pc,  (4.70) 

3  J  J 

for  some  0  <  pa,pb,pc  <  1.  If  we  now  make  P(  =  0,  changing  thus  Pi  into  P[  which 
includes  instead  of  /],  we  have 


PiP'l’Pz)  =  Pl°Pa  +  PP-Pb  ~  Pl°PpPc • 

3  3  3  3 


(4-71) 


Therefore, 


P(P[,  Pi)  ~  P(Pi,P2)  -  [pr  ~  Pp)Pa  +  (1  -  PpPpPc)  >  0. 


(4.72) 


Which  implies 


i 


P(P[,P2)  >  P{Pi,P2).  (4.73) 

Which  is  a  contradiction. 


I 


Lemma  4.2  In  an  optimum  solution  (Pi,  Pi)  one  of  the  two  paths,  say  P1(  is  such 
that  P[  =  0  for  j  =  1, . . . ,  m. 

Proof.  Suppose  that  the  tuple  (Pa,Pi)  is  an  optimum  solution,  and  that  for 
neither  of  these  two  paths  are  all  P-7  =  0.  From  the  previous  lemma  we  have  that 
Pi  =  Pb  —  1  cannot  occur  for  any  j.  Consider  now  the  pair  (Pl5P2)  defined  by 


P(  —  0  for  j  =  1, . . . ,  m 


Pi 


1  if  Pf  =  0  and  Pf  =  1; 
Pf  otherwise. 


This  means 


Pi  =  l,P/=0  =►  Pf  =  0,  Pf  =  1 

P’=0,p;  =  l  =>  P(  =  0,  Pi  =  1 

Pj  =  0.  Pi  =0  =>  Pi  =  0 .Pi  =  0.  14.741 


From  this  construction  we  have  that  P\  U  P?  =  Pa  U  Pi,',  we  also  have 


n  ^  n  ^  =  n  ^  n  p<- 

ieP i  leP?  iePa  feP6 


We  want  to  prove  that  P(PuPi)  >  P(Pa,Pb)-,  or  equivalently  that 


P(Pl,P2)~P(Pa,Pb)>0. 


We  have 


p(PuPi)-  p(Pa,Pb)  =  npi+ripi-  n  ^ 

i£Pi  ieP2  /6P1UP2 


-  ( n  ^ + n  P'  -  n  pi 

,iePa  iePb  lePaUPt 


n  Pi  +  n  Pi  -  n  Pi + n  Pi 

lePi  ieP2  Viepa  iePt  , 


Define  now 


xi  =  n  Pi>  z2 =  n 

(6P1  iep2 


=  n  Pi>  ^  =  n  Pi- 


(ePa 


ieP6 


Take,  without  loss  of  generality,  xa  >  x*.  We  also  have,  by  construction, 


(4.75) 


(4.76) 


(4.77) 


(4.78 

X]  >  x, 


and  Xi  >  X2 .  Define  c  =  xtX2  =  x„Xi.  Then, 


c  c 

^(xi,x2)  -  P(xa,xb)  =  (xi  H - )  —  (xa  H - ) 

Xi 

=  f(xi)-f(xa).  (4.79) 

Where  /(x)  =  x  +  ^.  For  x  >  >/c,  /(x)  is  an  increasing  function.  We  have  xy  >  x2 
and  c  —  xyx2,  and  this  guarantees  xy  >  -y/c.  Similarly  we  have  that  xa  >  \fc. 
Therefore 

>  a:a  =>  /( xi)  >  /(i0).  (4.80) 

Which  implies  P{xi,x2)  >  P(xa,z.). 

I 


We  can  now  prove 

Theorem  4.1  The  two  non-disjoint  path  maximum  reliability  problem  is  NP-complet 

Proof.  The  problem  is  easily  seen  to  be  in  NP,  i.e.  for  given  Pi  and  P2  it  can 
be  determined  polynomially  if  the  probability  that  at  least  one  of  the  two  paths 
is  operative  is  greater  or  equal  than  pth •  From  the  two  previous  lemmata  we  have 
that  we  can  always  take  Py  to  be  fixed,  namely  P(  =  1,  j  =  1, . . . ,  m.  The  problem 
is  then  reduced  to  the  particular  case  of  the  complementary  case  problem  that  was 
used  in  chapter  two  for  the  reduction  to  partition. 


We  also  have  the  following. 


Theorem  4.2  The  two  non-disjoint  path  maximum  reliability  problem  with  loga¬ 
rithmic  inputs  is  NP-hard. 

It  is  not  known  if  this  problem  is  in  NP. 


4.3  The  Algorithm 

In  this  section  we  present  a  pseudo-polynomial  algorithm  for  the  problem  described 
in  section  4.2.  The  algorithm  is  basically  dynamic  programming.  Given  G  = 
( N,L ),  a  directed,  acyclic,  stratified  network,  we  generate  G2  =  (jV2,L2),  the 
corresponding  squared  network  as  in  section  3.3.2,  with  the  difference  that  we  omit 
the  requirement  that  in  3.3.2  guarantees  path  disjointedness.  That  is,  if  there  is  a 
node  k  G  5,-,  and  a  node  l  G  S,+i,  with  a  link  (k,l)  G  L,  then,  in  G2  there  will  be 
a  link  { kk,ll ).  To  every  path  connecting  node  ss  to  node  tt  in  G2  correspond  two, 
not  necessarily  disjoint,  paths  in  G,  both  connecting  nodes  s  and  t. 

A  link  ( ik,rs )  G  G2  is  labeled  by  three  coefficients:  c|ijfer3)  =  n^tr)]  c2tjk  rj)  = 
n(k,,)\  and  c2ifc  rJ)  =  0,  unless  i  —  k  and  r  =  s,  in  which  case  c^kk3^  =  c^kk  = 

c}kk,33)  =  n<*,a>- 

Given  a  path  P  in  G2,  the  probability  that  at  least  one  of  the  two  corresponding 


paths,  Pi  and  P2  E  G,  is  operative  is  given  by 


P(Pi,P2 )  =  a^ieP!"' +  aEiep2n'  _  aEiePlni+Ei6p2ni  Ei6Plnp2 


aEiePc(J  +  aE,eP^_0E,eP^E(ePc?  Eigp^f.  (4-8i) 


Suppose  copy  one  of  the  message  is  sent  to  node  i  E  S’*  following  fixed  path 
P{.  Similarly,  copy  two  is  sent  to  node  j  G  5*,  following  fixed  path  P2.  Define 
now,  Tii  —  EieP;  nii  n2  =  E iep^nO  and  n3  =  YheP^nP^i-  Or  equivalently,  if  P' 
is  the  path  in  G 2  corresponding  to  P{  and  Pj,  nj  =  EieP' c?in2  =  EieP'  Cj2-,  and 
n3  =  Ejep'  cf. 

Let  P"  and  P^  be  the  paths  the  two  copies  should  follow  if  they  were  to  start 
from  nodes  i  and  j  to  node  t,  so  that  the  probability  that  at  least  one  of  them 
arrives  is  maximized.  Notice  that  the  choice  of  these  paths,  the  paths  the  two 
copies  would  follow  after  the  fc^h  iayer  if  they  were  to  make  it  that  far  before 
encountering  a  failed  link,  depends  on  P[  and  P2,  the  paths  along  which  the  copies 
were  routed  towards  the  layer.  For  example,  is  P[  is  much  more  reliable  than 
P2,  the  optimal  choice  of  P"  and  P2  would  typically  favor  P",  as  it  is  more  probable 
that  copy  one  has  actually  reached  the  k^  layer.  The  choice  of  these  paths  depends 
only  on  n\,  n2,  and  n3. 

Define 


^i(ni,n2,n3)  =  ]T  n(, 
ieP[' 
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v.v-v- 


^:{nun2,n3)  =  ni, 


v?j{ni,n2,n3)  =  n': 

teP"nP” 


(4.82) 


Define  7r,y  (n1}  n2)  rc3)  as  the  maximum  probability  that  at  least  one  of  these  two 
copies  reaches  node  t.  Then,  given  that  node  ij  £  S£,  we  have 


tt ■«(nljna,n3)  =  max  {  ani+c‘u.ko+^^(ni+cuJ.M>'^+c^O'n3+clJ.u)) 

(tj.fci)  6L2 

+  an»+ckifcC+^'(n‘+c(u.*oin2+cw.to-n3+,:<iy.fc«>) 

(ni+cW,U>+^l(ni+cW,Jt0,n3+cl;^0'"3+cl';,U)) 
+^+e<iiJkl>+v«(ni+<<.7.«>'^+e<uJk0,n*+e0i.H>) 
a  -(n3+4y.to+^(n‘+cw.*i>-n3+ckto-n3+cUtf>)))  }(.4.83) 


Let  mn  G  S|+1  be  such  that  ( ij,mn }  is  the  link  that  achieves  the  maximum  in 
4.83.  Then  we  have,  for  p  =  1,2,3 

(ni,n2,n3)  =  cp{ijmn)  f  i/*n(ni  +  c\ijimn),n2  +  c^.mn),n3  +  c\ljmn]).  (4.84) 

Equations  (4.83)  and  (4.84)  provide  the  means  to  calculate  the  optimal  routing 
for  two  copies  of  a  message.  The  maximum  of  P(Pi,P2),  as  in  equation  (3.36),  is 


given  by 


P  {Pi,  P2)  =7T„  (0,0,0). 


/*  »’*  L.V  »"* 'V 


(4.85) 


t 

i 

The  paths  can  be  obtained  by  recording  the  decisions  taken  when  using  equation  j 

i 

(4.84).  Notice  that  j 

l 

\ 

n,a{nun2,n3)  =  an'  +  an*  -  ani+n*~n\  I 


"L(niin2,n3)  =  rii,  i  =  1,2,3.  (4.86) 


We  next  look  at  the  performance  of  this  algorithm.  If  M*  is  an  upper  bound 
on  the  maximum  length  a  path  can  have  from  node  ss  to  a  node  in  Sk,  where  the 
length  of  a  link  /  is  given  by  nj,  if  we  know  the  z/£  (ni,  n2,  n3)  for  n  =  1,2,3,  and 
0  <  ni,n2,n3  <  Mfc+1,  and  i,j  €  i.e.  ij  £  S%+1,  then  it  is  possible  to  calculate 

the  r,"(n1,n2,n3)  for  n  =  1,2,3;  and  0  <  ni,n2,n3  <  M*;  ij  £  S%. 

This  dynamic  programming  solution  of  the  problem  presents  the  usual  problem 
of  state  space  explosion.  Any  attempt  at  using  it  for  practical  application  would 
require  careful  programming.  In  particular,  one  way  to  optimize  the  performance 
of  this  approach  is  to  make  the  Mk  as  tight  as  possible.  Notice  that  this  approach 
could  also  be  used  for  the  case  of  a  link-disjoint  path  as  in  section  3.2.  Also,  by  a 
simple  device  as  in  the  introduction,  we  can  treat  the  case  of  node-disjoint  paths. 
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Chapter  5 


Conclusions  and 
Recommendations  for  Further 
Research 


In  this  thesis  we  have  presented  several  ways  of  selecting  two  s-£-paths  for  simulta¬ 
neous  transmission  of  two  copies  of  one  message  in  a  data  communication  network. 
In  section  5.1  we  compare  how  these  approaches  perform  in  practice.  In  section  5.2 
we  look  at  suggestions  for  further  research. 


5.1  Comparing  Approaches 


We  have  chosen  three  of  the  route  selection  procedures,  all  of  them  using  loga¬ 
rithmic  input.  These  three  have  been  selected  because  they  can  be  implemented 


efficiently: 


•  Complementary  Path  Algorithm,  chapter  2,  (Compath). 

•  Finding  two  disjoint  s-£-paths  Pi  and  P2  such  that  the  probability  that  both 
of  them  (Pi  and  P2)  are  operative  is  maximized,  (Twopathand). 

•  Finding  two  disjoint  s-f-paths  Pi  and  P2  such  that  the  probability  that  at 
least  one  of  them  (Pi  or  P2)  is  operative  is  maximized,  (Twopathor). 

These  three  methods  have  been  programmed.  We  have  some  limited  experience 
comparing  their  performance  in  actual  examples.  We  next  comment  on  each  of 
them  separately  and  then  compare  their  behaviour. 

Compath:  The  algorithm  given  in  chapter  two  for  this  problem  works  on  unre¬ 
stricted  networks,  it  is  pseudopolynomial.  But,  if  for  every  link  l  we  take  nt  <  n, 
where  pi  =  a"',  and  4n4  is  a  positive  integer,  then  the  algorithm  becomes  polyno¬ 
mial  in  \L\  and  |JVj.  Specifically,  it  takes  0{n\L\ 2)  operations.  For  a  given  input 
network,  given  basic  path  Pi,  given  the  nC s,  and  given  the  basic  operational  prob¬ 
ability  a  the  solution  is  dependent  on  the  value  of  this  o,  as  follows.  The  closest 
a  is  to  one  the  more  it  is  likely  that  P2,  the  complementary  path,  be  disjoint  from 
Pi.  The  closest  a  is  to  zero  the  more  likely  it  is  that  P\  and  P2  will  share  links. 

Twopathand:  the  algorithm  given  in  [16]  consists  of  two  consecutive  applications 
of  Dijkstra’s  shortest  path  method  with  the  addition  of  an  0(|L|)  step  between 
these  two  applications.  Therefore  this  is  a  polynomial  procedure.  It  works  on  un¬ 
restricted  networks.  The  solution  is  independent  of  the  basic  operative  probability 


a.  As  a  by-product  we  obtain  a  shortest  path  of  the  network  which  can  be  used  as 
part  of  the  input,  for  Compath,  the  fixed  path  (Pi). 


Twopathor:  the  pseudopolynomial  algorithm  given  in  chapter  three  applies  only 
to  acyclic  networks.  The  algorithm  works  by  finding  all  pairs  of  non-dominated  ex¬ 
treme  disjoint  s-f-paths  (Pi,P2).  Once  this  set  has  been  found  it  can  be  calculated 
which  pair  attains  the  optimum  for  the  given  value  of  a. 

To  test  the  comparative  performance  of  these  algorithms  we  generated  acyclic 
networks  with  random  values  of  n;,  pi  =  an',  for  the  links  of  the  network,  ni 
takes  the  values  1,2, 3, 4, 5  with  equal  probability.  The  generated  networks  are 
characterized  by  the  number  of  nodes  they  have,  and  by  a  parameter  called  width , 
this  parameter  is  such  that  there  exists  a  link  connecting  node  i  to  node  j  if  and 
only  if  i  <  j  <  j  -f-  width.  Here  we  present  result  for  six  networks: 


netl:  consisting  of  10  nodes,  with  a  width  of  3. 


•  net2:  consisting  of  15  nodes,  with  a  width  of  4. 


net3:  consisting  of  20  nodes,  with  a  width  of  4. 


net4:  consisting  of  25  nodes,  with  a  width  of  4. 


net5:  consisting  of  40  nodes,  with  a  width  of  5. 


•  net6:  consisting  of  30  nodes,  with  a  width  of  6. 


Tables  5.1,  5.2  and  5.3  present  the  value  of  the  probability  that  at  least  one  of 
the  two  messages  arrives  at  the  destination  node  for  the  pair  of  paths  provided  by 


m  •  m  >  -<  •  *  »  •  » 

"Cvl-.v.w.v  • 


v*  •  *  V*  «v  O  V  -V  V* 


A  *•  %  ’•  _*»  A 


ill 


a=0.99  Twopathand 

Twopathor 

Compath 

netl  .9910 

.9910 

.9910 

net2  .9909 

.9909 

.9909 

net3  .9905 

.9905 

.9905 

net4  .9756 

.9756 

.9756 

net5  .9754 

.9754 

.9754 

net6  .9905 

.9905 

.9905 

Table  5.1:  Message  arrival  probability  for  a  =  0.99 


the  algorithms  indicated  in  the  columns.  The  source  node  is  node  number  1,  the 
destination  node  is  the  highest  numbered  node  of  the  network.  The  three  tables 
correspond  to  different  values  of  the  basic  probability  a.  We  use  a  most  reliable 
path  of  the  network  as  the  fixed  path  in  the  input  for  algorithm  Compath. 

Notice  that  even  though  the  paths  found  by  algorithm  Twopathand  maximize 
the  probability  that  both  path  are  operative,  the  value  shown  in  the  tables  is  the 
probability  that  at  least  one  of  them  is  operative. 

Our  computational  experience  suggests  that  the  solutions  obtained  by  the  al¬ 
gorithm  Twopathand  are  very  close  to  those  given  by  algorithm  Twopathor,  in  the 
cases  shown  here  they  happen  to  be  always  the  same.  This  can  be  explained  by  re¬ 
calling  that  the  solution  given  by  algorithm  Twopathor  is  always  a  non-dominated 
extreme  pair  of  link-disjoint  paths  of  the  network,  which  is  also  the  case  for  al¬ 
gorithm  Twopathand,  though  not  necessarily  the  same  pair,  if  the  network  is  such 


that  there  exists  only  one  non-dominated  extreme  pair  of  link-disjoint  paths  the 
solution  provided  by  these  two  algorithms  is  necessarily  the  same.  From  the  exam¬ 
ples  here  presented  one  could  infer  that  the  family  of  generated  networks  has  this 
property  for  sufficiently  large  values  of  the  number  of  nodes  and  the  width  of  the 
networks. 

The  solutions  given  by  algorithm  Twopathand  are  also  close  to  those  given  by 
algorithm  Compath,  particulary  when  a  is  close  to  one,  see  the  tables  for  a  =  0.99. 
When  the  value  of  a  decreases,  see  table  5.3,  algorithm  Compath  does  produce 
better  solutions.  This  suggests  that  in  this  case,  ‘low’  o,  sharing  links  is  necessary 
to  improve  the  probability  that  at  least  one  of  the  two  copies  of  the  message  arrives 
at  the  destination  node. 

The  numerical  results  obtained  indicate  that  the  possible  improvements  to  be 
gained  by  using  algorithm  Twopathor  might  not  justify  the  increased  computational 
expense.  For  a  close  to  one  the  solutions  obtained  by  algorithm  Twopathand  art 
very  close  to  that  obtained  by  algorithm  Compath  when,  as  here,  a  most  readable 
path  of  the  network  is  used  as  the  fixed  path  (Pi). 

5.2  Suggestions  for  Further  Research 

•  Some  complexity  issues  remain  unanswered: 

-  Is  the  problem  treated  in  chapter  three  (finding  two  disjoint  s-£-paths 
that  maximize  the  probability  that  at  least  on  of  them  is  operative) 
iVP-complete  when  the  network  is  acyclic? 


•*“  -A  /  /  •  .*  \  . 
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-  Is  the  problem  treated  in  chapter  four  (finding  two  not  necessarily  dis¬ 
joint  s-t-paths  that  maximize  the  probability  that  at  least  one  of  them 
is  operative)  strongly  IVP-complete  for  an  unrestricted  network,  i.e.  a 
network  that  may  contain  cycles. 

From  a  more  general  perspective  the  subjetct  of  “robustness”  for  algorithms 
in  data-communication  networks  seems  to  be  open.  The  design  of  flow-control 
and  routing  algorithms  that  will  behave  reasonably  in  the  presence  of  unre¬ 
liable  input  data  seems  to  be  deserving  of  attention. 
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